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EPISOMAL VECTOR AND USES THEREOF 
Field of the Invention 
5 The invention relates in general to episomal vectors. 

Background of the Invention 

In lower organisms, such as prokaryotes and budding yeast, replication origins contain 

10 welldefined cis-sequences called "replicators" and interaction of these sequences with a specific 
initiator protein complex leads to the initiation of DNA synthesis in these cells (Jacob et ai, 1963, 
Stillman, 1994 and references therein; DePamphilis, 1993). Extrachromosomal replicators, 
generally, in addition to their origin function, encode functions that assure equal distribution of 
replicated molecules (i.e., partitioning) between daughter cells at cell division. For prokaryotic 

15 plasmids these partitioning functions are well studied and can be provided by several different 
mechanisms in bacterial cells (Nordstrom. 1990). In higher organisms, less is known about 
mechanisms for partitioning of extrachromosomal replicators. For artificial plasmids in yeast, 
chromosomal centromeres can provide this function. In metazoan cells, one well studied example 
of a stable extrachromosomal replicator exists - the latent origin oriP from Epstein-Barr Virus 

20 (EBV). The maintenance function of EBV requires the viral replication factor EBNA-1 and a 
series of binding sites for EBNA-1 termed the family of repeats (PR). A model that has been 
suggested for the function of the EBNA-l/FR combination is that EBNA-1 bound to PR provides 
physical retention of the oriP plasmids in the cell nucleus (Krysan et aL, 1989). 

Papillomaviruses are also capable of stable extrachromosomal replication. Infection and 

25 transformation of the cells by papillomaviruses follows single hit kinetics. (Dvoretzky et al, 
1980). Papillomavirus genomes are maintained as multicopy nuclear plasmids in transformed cells. 
The viral life-cycle can be viewed as three stages (Botchan et al, 1986). First, following initial 
entry, the papillomaviral genome is amplified in the cell nucleus, i.e., viral DNA is synthesized 
faster than chromosomal DNA and the copy-number is increased. The second stage represents 

30 maintenance of the viral DNA at a constant copy-number and latent phase of the viral infection is 
established. During the third, vegetative, stage of the viral life-cycle viral DNA amplification is 
initiated again, late proteins are synthesized and viral particles are assembled. 
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The El and E2 proteins are the only viral factors required for initiation of papillomavirus 
DNA replication (Ustav and Stenlund 1991; Ustav et ai, 1991; Yang et ciL^ 1991; Chiang ei aL, 
1992; Kuo etal, 1994). A similar, if not identical, set of cellular replication factors and enzymes, 
in addition to viral initiator proteins, is utilized by SV40 (Tsurimoto et ai, 1990, Weinberg et aL, 
5 1990) and BPV-1 ( MuUer et ai, 1994 ) at the origin of replication to initiate DNA synthesis. 
Analysis of the essential cis-sequences shows that the BPV-1 minimal origin (Ustav et ai, 1993) 
resembles a typical eukaryotic origin of replication (DePamphilis, 1993) and it has been suggested 
that this similarity could also be extended to the mechanisms of replication of all papovaviruses 
(Nallaseth and DePamphilis, 1994; Bonne- Andrea et ai, 1995). However, the ability of the 
10 papillomaviruses to persist as plasmids distinguishes papillomaviruses from other papovaviruses. 
It has been known for more than 10 years that BPV-1 replicates in transformed cells as a 
multicopy nuclear plasmid, which can persist in the tissue culture cells over long periods of time 
(Law et ai, 198 1) This indicates that papillomaviruses have efficient mechanisms for 
segregation, i.e., control of copy-number and partitioning, in the transformed cells. 
15 . The role of viral factors, ds-acting sequences and cellular factors in long-term persistence 

of papillomaviruses, which relates to the segregation functions of viral DNA, is not clearly 
understood. That is, the regions of the viral genome which specify copy number are not identified 
in the prior art; nor are the regions of the viral genome which participate with the host cell to 
ensure proper segregation of the viral genome during partitioning. Much more is understood with 
20 respect to the initial amplification stage of the papillomavirus life-cycle. 

Bovine Papillomavirus (BPV) and Human Papillomaviruses (HPVs) persist as stably 
maintained plasmids in mammalian cells. Transient assays, i.e., on the order of several hours to 3- 
4 days, have been used to define the minimal origin of replication (MO) which is required for 
transient replication in BPV (Ustav et al., EMBO J, 10, 4231-4329, 199 1) and for several HPV 
25 subtypes. Two trans-acting factors encoded by BPV and HPVs, namely El and E2, have been 
identified in transient assays which are necessary to mediate replication in many cell types via MO 
(Ustav et al., EMBO J., 10, 449-457 (1991); Ustav et aL, EMBO J, 10, 4231-4329, (1991); 
Ustav et ai, PNAS, 90, 898-902 (1993).) El and E2 fi-om BPV will replicate via the BPV MO 
and via the MO of many HPV subtypes. (Chiang et ai, PNAS, 89, 5799-5803 (1992). El and E2 
30 from HPV will replicate via 

the BPV MO and via the MO of many HPV subtypes. (Chiang et al., PNAS, 89, 5799-5803 
(1992). Replication of plasmids containing the above elements is high level but transient in 
eukaryotic cells. Plasmid loss is rapid in the presence and absence of selective pressure. 



BNSDOCID:<WO 9724451 A2> 



SUBSTITUTE SHEET (RULE 26) 



wo 97/24451 




PCT/EE96/00004 



The papillomaviais life cycle has been the subject of much research. DifFerent ponions of 
the viral genome have been tested in short-term, i.e., transient, transcription or replication assays. 
See. for example, Szymanski et ai, 1991, Jour. Virol. 1 1:5710; Vande Pol et ai, 1990, Jour. 
Virol 64:5420; Sowden et aL^ 1989, Nucl. Acids Res. 17:2959, Stenlund, 1987, Science 
5 236:1666; Sedman e/ a/., 1995, Eur. Jour Mol. Biol. 14:6218; Haugen et aL^ 1988, Eur. Jour. 
MoL Biol. 7:4245; and Kuo etal, 1994, Jour. Biol. Chem. 269:24058. 

The BPV 69% transforming region has been used to introduce the rat preproinsuiin gene 
into mouse cells (Sarver e/a/., 1981, Mol. Cell, Biol. 6:486). 

The PMSl and PMS2 regions of BPV have been reported to "independently support" 

10 extrachromosomal replication of the Tn5 neomycin gene in cells that provide viral factors in trans 
PMS-1 (plasmid maintenance sequence- 1) is localized within a 521 bp region mapping at 
positions 6945-7476 of the BPV genome, and PMS-2 has been localized to a 140 bp region 
within the putative open reading frame for the El protein, which maps at positions 1515-1655 of 
the BPV genome. It has been reported that recombinant plasmids carrying either of the PMS 

15 elements are unrearranged and stably maintained at a constant copy number. In addition. El, E6 
and E7 are identified as candidate factors for trans regulation of the plasmid state See Lusky et 
al, 1984, Cell 36:391, and Lusky a/., 1986, Jour. Virol. 11:729. 

Woo et al, W094/12629 report a vector containing a papilloma virus origin of replication, 
the "vector maintenance sequence" described in Lusky et ai^ 1984, supra, a therapeutic nucleic 

20 acid, and an E2 gene sequence or an El /E2 chimeric gene. Woo et ai suggest that such a vector 
may be tested for stable episomal maintenance over a period of 2-30 days post-transfection. The 
"vector maintenance sequence" of Lusky et ai, 1984, which is described in Woo et ai, is shown 
herein' not to be capable of providing long-term vector persistence. 

Mutations in the E2 gene have a pleiotropic effect on viral gene fimctions, including 

25 oncogenic transformation. These effects may be the result of the requirement for E2 expression to 
regulate viral transcription (see DiMaio and Neary, 1989, The Genetics of bovine papillomavirus 
type 1 papillomaviruses and human cancer. (Ed. N. Pfister), CRC Press, Boca Raton, FL). The 
BPV-1 E2 protein has been shown to activate viral enhancers in trans (Spalholz et ai, 1985, Cell 
42:183). The E2 open reading frame has been shown to encode a site-specific DNA binding 

30 protein that can bind to several sites within the E2 responsive enhancers 1 and 2 (Androphy et ai, 
1987, Nature 325:70; Moskaluk et ai, 1987, Proc. Nat. Aca. Sci. 84:1215). E2 recognirion sites 
that have been studied to date include the sequence motif ACCN6GGT, where N is any 
nucleotide (Hawley-Nelson et ai, 1988, Eur. Jour. Mol. Biol. 7:525; Hirochika et ai, 1988, 
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Genes Dev. 2:54, McBride, 1988, Eur. Jour. Mol Biol. 7:553; Moskaluk et a/., 1988a, Pre. Nat. 
Aca. Sci. 85:1826), and it is suggested that E2 binds this palindrome as a dimer (Dostani ei ai, 
1988, Eur. Mol. Biol. Org. Jour. 7:3 807; McBride a/., 1989, Proc. Nat. Aca. Sci. 86:510). Not 
all of these sites appear to bind E2 with the same strength. Sites having the motif ACCGN4CGGT 
5 appear to bind better than sites that deviate in the fourth and ninth bases (Hawley-Nelson et ai, 
1988, supra; Moskaluk et aL, 1988b, supra). It also appears that some of the target sites for the 
protein have different capabilities for activation in vivo (Harrison e( aL, 1987, Nucl. Acids. Res. 
15:10267; Haugen et al,, 1987, Eur. Jour. Mol. Biol. 6:145; Spalholz et al,, 1987, Jour. Virol. 
61:2128). Li ei ai (1989, Genes & Develop. 510) analyze 17 E2 binding sites in the BPV- I 
10 genome and show that affinities for E2 vary over a 300-fold range. Li et ai also find that the 
presence of the conserved consensus ACCGN4CGGT did not necessarily guarantee that the 
binding site would be stronger than one with a deviant base, and suggest that the presence of this 
palindrome is not a sufficient parameter for predicting the strength of a binding site. 

A truncated form of E2 protein exists which is defective in transcriptional activation and 
1 5 competent in viral replication. 

Dowhanick et ai, 1995, Jour. Virol. 69:7791, describe an E2 deletion mutant containing 
residues 1-218 of the protein which is said to retain a DNA replication function, but is defective in 
transcriptional trans-activation. Also described are several E2 point mutants (331 and 344) which 
are defective in both transcriptional transactivation and DNA binding. 
20 In addition, subsequent to Applicant's disclosure of E2 point mutants which are defective 

in transcriptional activation and replication competent in the subject priority document, which E2 
mutants are also described in Abroi eiaL, 1996, Jour. Virol. 70:6169, additional similar, if not 
identical in some instances, E2 point mutants have been identified. Ferguson and Botchan, 1996, 
Jour. Virol. 70:4193, describe mutations at amino acids 73 and 74 which retained replication 
25 function but failed to activate transcription". Sakai et ai, 1996, Jour. Virol. 70:1602, describe 
three point mutants (R37A, 173 A, and W92A) in HPV defective for transcriptional activation but 
retaining wild type DNA replication activity in transient assays. 

One object of the invention is to provide a recombinant vector which, by virtue of the 
sequences it contains, is stably maintained and thus persists long-term in mammalian cells. 
30 Another object of the invention is to provide a recombinant episomal vector which is 

stabilized via regulatory sequences which are contained within a relatively small amount of DNA 

Another object of the invention is to provide a cis-acting element which confers long-term 
stability to a transiently replicating eukaryotic episomal plasmid. 
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Yet another object of the invention is to provide an episomal genetic element which 
replicates independently of the host cell chromosomal DNA, and is therefore not dependent upon 
regulatory control of replication by the host cell genome. 

Another object of the invention is to provide stable and reliable plasmid copy number in 
5 both Gl and G2 stages of the cell cycle. 

Yet another object of the invention is to provide a recombinant vector which is stably 
maintained at a constant copy number for multiple cell generations. 

Another object of the invention is to provide a recombinant vector which is able to persist 
over a long time period in eukaryotic, particularly mammalian cells, from which can be expressed 
10 a therapeutic, prophylactic, or marker gene. 

Another object of the invention is to provide a recombinant vector which is not restricted 
as to its ability to be maintained in a given cell type, but which is stably maintained in any one of 
numerous diverse mammalian cell types. 

Another object of the invention is to provide a recombinant vector containing sequences of 
15 viral origin which do not confer oncogenic properties to the transfected host cell, and is therefore 
safe to use in humans. 

Summary of the Invention 

20 The invention is based on the discovery of a vector system which permits long-term 

persistence in episomal form in a mammalian cell, and in particular to the discovery of a 
minichromosomal maintenance element, which element confers stable persistence of 
extrachromosomal (i.e,, episomal) DNA in mammalian host cells. 

The invention encompasses a method of obtaining long-term stable production of a gene 

25 product of interest in a host cell, comprising providing a host cell containing a vector comprising 
(A) a minimal origin of replication of a papilloma virus, (B) a minichromosomal maintenance 
element of a papilloma virus, and (C) a gene encoding the gene product, wherein the vector, when 
present in a mammalian host cell, persists in the cell for at least about 50 cell generations in 
dividing cells or for at least about 8 weeks in non-dividing cells under nonselective conditions 

30 without an appreciable loss of copy number. 

The invention also encompasses a method of obtaining long-term stable production of a 
gene product of interest in a host cell, comprising providing a host cell containing a vector 
comprising papillomavirus sequences consisting essentially of (A) a papillomavirus E2 gene, (B) a 
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minimal origin of replication of a papilloma vinis,(C) a minichromosomai maintenance element of 
a papilloma viais, and (D) a gene encoding the gene product, wherein the vector persists in the 
cell for at least about 50 cell generations in dividing cells or for at least about 8 weeks in non- 
dividing cells under nonselective conditions without an appreciable loss of copy number. 
5 The invention also encompasses a method of obtaining long-term stable production of a 

gene product of interest in a host cell, comprising providing a host cell containing a pair of 
vectors comprising (I) a first vector comprising papillomavirus sequences consisting essentially of 
(A) a papillomavirus E2 gene, (B) a minimal origin of replication of a papilloma virus, and (C) a 
minichromosomai maintenance element of a papilloma virus, and (II) a second vector comprising 

10 papillomavirus sequences consisting essentially of (A) a gene encoding the gene product, (B) a 
minimal origin of replication of a papilloma virus, and (C) a minichromosomai maintenance 
element of a papilloma virus, wherein the vector persists in the cell for at least about 50 cell 
generations in dividing cells or for at least about 8 weeks in non-dividing cells under nonselective 
conditions without an appreciable loss of copy number. 

15 The invention also encompasses use of a recombinant vector for obtaining long term stable 

maintenance of erogenous DNA in a eukaryotic host cell wherein the recombinant vector 
comprises: a minimal origin of replication of a papillomavirus; a minichromosomai maintenance 
element of a papillomavirus; and a heterologous DNA sequence encoding an expressible gene. 

Preferably, the time period over which the vector persists in the host cell under 

20 nonselective conditions without an appreciable loss of copy number is 6 weeks, and most 
preferably 8 weeks or 12 weeks or longer, or in terms of cell generations, 100 or 120 cell 
generations or longer. 

According to the claimed methods, long-term persistent vectors will include one in which 

the minichromosomai maintenance element consists essentially of the region of BP V mapping to 
25 positions 7590 to 7673; or wherein the minichromosomai maintenance element comprises (BPV 

E2 binding sites 6, 7 and 8) x, wherein x is 3 to 6 or wherein the minichromosomai maintenance 

element comprises at least 2 of the 3 E2 binding sites 6, 7 and 8. 

The invention therefore also encompasses a recombinant vector for stable long-term 

persistence of erogenous DNA in a mammalian host cell, the vector comprising a minimal origin 
30 (MO) of replication of a papillomavirus, a minichromosomai maintenance element (MME) of a 

papillomavirus, and a gene encoding a gene product of interest, wherein the vector is defined 

hereinbelow (1-4). 
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1. The vector comprising papilloma virus sequences consisting essentially of (A) a 
minimal origin of replication of a papilloma virus, (B) a minichromosomal maintenance element of 
a papilloma virus consisting essentially of at least two of the three E2 binding sites 6, 7, and 8, 
wherein the region of the vector comprising the minimal origin of replication and 

5 minichromosomal maintenance element consists of a DNA sequence different from the natural 
papilloma .virus sequence, and wherein the vector, when present in a mammalian host cell which 
expresses El and E2, persists in the cell for at least about 50 cell generations in dividing cells or 
for at least about 8 weeks in non-dividing cells under nonselective conditions without an 
appreciable loss of copy 

1 0 number. 

2. The vector comprising papilloma virus sequences consisting essentially of (A) a 
minimal origin of replication of a papilloma virus, and (B) a minichromosomal maintenance 
element of a papilloma virus consisting essentially of multiple E2 binding sites, wherein the 

15 distance between the minimal origin of replication and the minichromosomal maintenance element 
is less than about 1.0 kb, wherein the vector, when present in a mammalian host cell which 
expresses El and E2, persists in the cell for at least about 50 cell generations in dividing cells or 
for at least about 8 weeks in non-dividing cells under nonselective conditions without an 
appreciable loss of copy number. 

20 

3. The vector comprising papilloma virus sequences consisting essentially of (A) a 
minimal origin of replication of a papilloma virus, (B) a minichromosomal maintenance element of 
a papilloma virus consisting essentially of the region of BPV mapping to about positions 7590- 
7673 wherein the vector, when present in a mammalian host cell which expresses El and E2, 

25 persists in the cell for at least about 50 cell generations in dividing cells or for at least about 8 
weeks in nondividing cells under nonselective conditions without an appreciable loss of copy 
number. 

4. The vector comprising papilloma virus sequences consisting essentially of (A) a 
minimal origin of replication of a papilloma virus, and (B) a minichromosomal maintenance 

30 element of a papilloma virus consisting essentially of (BPV E2 binding sites 6, 7, and 8)x wherein 
X is 3-6, wherein the vector, when present in a mammalian host cell which expresses El and E2, 
persists in the cell for at least about 50 cell generations in dividing cells or for at least about 8 
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weeks in nondividing cells under nonselective conditions without an appreciable loss of copy 
number. 

As used herein, the term "consisting essentially of means that, with respect to 
papillomavirus sequences, those sequences which are both necessary and suflficient for long-term 
5 vector persistence without an appreciable loss of copy number. 

In a preferred embodiment of the invention, a vector of the invention will comprise 
papillomavirus sequences as well as other sequences relating to expression of a gene of interest. 
The papillomavirus sequences in the vector will preferably consist essentially of a papillomavirus 
MME and MO, and thus will not contain papillomavirus sequences that are not required for long- 
10 term stable persistence in a eukaryotic host celL The vectors thus advantageously do not contain 
papillomavirus sequences which are not both necessary and suflficient for long-term persistence in 
the episomal state. In addition, the vectors do not contain oncogenic sequences which are present 
in the papillomavirus genome. 

In preferred embodiments, the minichromosomal maintenance element of a papillomavirus 
15 is from BPV; the minimal origin of replication of papillomavirus is from BPV; the papillomavirus 
El protein is from BPV; the papillomavirus E2 protein is from BPV. 

Preferably, the vector further comprises a gene or genes encoding papillomavirus E2 
and/or El proteins, and the E2 gene most preferably encodes a mutant form of E2 which is a 
point mutant that is replication competent but defect in transcriptional activation As used herein, 
20 a "point mutant" may refer to either a single amino acid change, or several individual amino acid 
changes (2, 3, 4 etc.) which together confer the desired phenotype. A point mutant may be an 
amino acid substitution or a deletion or insertion. 

One panicularly useful form of a vector of the invention is a recombinant vector or vector 
system for stable persistence of erogenous DNA in a host cell, the vector comprising a minimal 
25 origin of replication of a papillomavirus, a minichromosomal maintenance element of a 
papillomavirus, and one or both of the papillomavirus El and E2 genes. 

The invention also encompasses a mutant form of a papillomavirus E2 protein wherein the 
replication function of the protein is competent and the transcriptional activation function of the 
protein is defective, wherein the mutant form of E2 protein differs from the wild-type E2 in a 
30 nucleotide point mutation which translates into an amino acid substitution. 

Preferred E2 point mutants are mutated in an alpha helical domain, for example in alpha 
helix 2 or 3, as defined herein 
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Additional preferred E2 point mutants useful according to the invention are R37A, E74A, 
and D 122A and D143A/R172C. 

A particularly striking feature of the invention is that the stable vectors of the invention are 
not restricted to the host cell specificity of papillomavirus: This release from the natural 

5 papillomavirus host cell type restriction has been achieved by removing key genetic elements of 
the papillomavirus genome from their native context; for example, expression of the 
papillomavirus genes encoding El and E2 proteins is not controlled by the promoters that are 
native to these genes, but rather the El and E2 genes are placed under the control of non-native, 
i.e., heterologous promoters, which are either functional in a broad range of mammalian cells or 

10 tissues or are cell- or tissue-specific. 

It is preferred according to the invention that the expressible papillomavirus gene encoding 
El or E2 include a structural gene encoding El or E2 operatively associated with regulatory 
sequences for expression of the structural gene in a host cell. Such regulatory sequences will 
include a promoter and /or may optionally include an enhancer. The promoter is preferably a 

15 promoter that is non-native (i.e., heterologous) to the El or E2 structural gene. The promoter is 
may be functional in more than a single tissue type, i.e., one that is able to initiate transcription in 
a broad range of tissue types, and therefore unrestricted with respect to its tissue specificity. 
Alternatively, the promoter may be functionally restricted to a single tissue type, and therefore 
tissue-specific. 

20 As used herein, tissue-specific and cell-type-restricted both refer to wherein a prompter is 

operable substantially in the same tissue-type or cell-type. 

Preferred promoters comprise one of the thymidine kinase promoter and a strong 

promoter such as the SRalpha promoter. It is expected that a vector of the invention which 

includes tissue-specific regulatory elements operatively associated with the El and/or E2 genes 
25 will be capable of long-term persistence only in those cell types in which the regulatory elements 

are functional. 

In its most useful form, a recombinant vector of the invention will include an expressible 
gene of interest. 

A vector of the invention which contains an expressible gene of interest contains not only 
30 a structural gene encoding a protein or RNA of interest, but also is operatively associated with 
regulatory sequences for expression of the structural gene in a host cell. Such regulatory 
sequences may include not only a promoter, but also additional regulatory sequences such as an 
enhancer, splice sites, and poly-adenylation sequences. These regulatory elements that control 
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expression of the staiciural gene promoter may be regulatory elements that are native to the 
structural gene (i.e., the control sequences that are naturally associated with these genes in their 
native environment), or they may be non-native to the structural gene, and therefore heterologous 
regulatory elements. These elements, particularly the promoter, may be functional in more than a 
5 single tissue type or may be functionally restricted to a single tissue. 

It is expected that a vector of the invention which includes a tissue-specific regulatory 
element that is operatively associated with a structural gene of interest will express that gene of 
interest only in those host cell types in which the regulatory elements are functional (i.e., specific). 
It is preferred according to the invention that the host cell type-restricted expression (i.e., 

10 tissue-specificity) of the structural gene of interest be coordinated with the tissue-specificity of the 
regulatory elements operatively associated with the El and E2 genes. That is, one may envision 
that the tissue-specificity of El, E2, and structural gene of interest is the same. Alternatively, the 
tissue-specificity of El and E2 gene expression may be broader than the tissue-specificity of 
expression of the gene of interest, resulting in a broad host cell type range for long-term 

1 5 persistence of a vector of the invention, and a more limited host cell type range for expression of 
the gene of interest. Alternatively, the tissue-specificity of El and E2 gene expression may be 
quite limited (for example, to a single cell type), and the tissue-specificity of expression of the 
gene of interest broad or unlimited, resulting in a limited host cell type in which a vector of the 
invention can persist long-term, which in turn is the limiting factor in determining the type of host 

20 cell in which the gene of interest is expressed. 

In another preferred embodiment of a vector of the invention, the vector also includes a 
bacterial host cell origin of replication and a gene encoding a selectable marker for preparation of 
vector DNA in a bacterial host cell. 

The invention also features host cells containing the vectors herein described, such host 

25 cells being most preferably being eukaryotic, and of mammalian origin, such as of muscle, gut, or 
brain origin. 

The invention also features a method of manufacture of a vector, which method includes 
culturing a host cell containing a vector described herein. It is panicularly preferred that such 
manufacture occur in a lower eukaryotic cells, e.g., yeast or insect, or prokaryotic cells, e.g., 
30 bacterial cells such as E.coli or Salmonella. Therefor, the vector will further include an origin of 
replication of yeast, insect or bacterial origin, and one or more genes encoding a selectable 
marker, e.g., a gene encoding kanamycin resistance, for selection of cells containing the vector.' 
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The invention also features a method of obtaining stable expression of a gene of interest in 
a cell, comprising providing a host cell as described above. 

The invention also features methods of treating a disease stemming from a genetic defect, 
comprising administering a therapeutically effective or a prophylactic amount the vector of the 
5 invention to a patient afflicted with the disease. 

The invention also includes use of a recombinant vector of the invention in the treatment 
of a disease. 

The invention also encompasses a gene delivery system comprising a vector of the 
invention in combination with a gene delivery vehicle, which may be of viral or non-viral origin. 
10 The invention also encompasses a method of producing a protein or RNA of interest in a 

host cell or a transgenic animal, comprising culturing a host cell under conditions which permit 
production of the protein of interest, or providing a transgenic animal which produces the protein, 
as described herein. 

The invention also encompasses a mammalian model of disease, for screening of drugs to 
15 treat the disease or. for testing of therapeutic or prophylactic regimes, the mammalian model 
comprising a transgenic animal whose cells contain a vector of the invention. 

The invention also encompasses a transgenic animal containing an episomal vector as 
described herein, the vector encoding a protein of interest. 

As used herein, a "transgenic animal" refers to an animal, preferably a mammal, which 
20 contains in some, but not necessarily all, of its cells an episomal vector, as described herein. 

The invention also features kits for providing stable persistence of a vector in a host cell, 
the kit comprising a vector or a host cell as described herein and packaging materials therefore. 

A kit of the invention may also include a mutant E2 protein as described herein or a gene 
encoding this protein, wherein the E2 mutant is thus provided for stable persistence of a vector in 
25 a host cell. 

Uses and advantages of the invention are as foilows. 

The invention is useful in in vivo and ex vivo human gene therapy where correction of 
30 inherited or acquired genetic defects is desired. The invention also is useful in vaccination 
protocols where resistance or immunity to infectious pathogens, such as HIV, Hepatitis C Virus, 
Hepatitis B virus. Herpes virus, parasitic pathogens such as Tuberculosis and Leishmaniasis, and 
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protozoans such as ameobic dysentery, is desired, or the elimination or induced quiescence of 
aberrant cells, such as cancer cells, is considered beneficial. 

Recombinant vectors of the invention are usefijl in that they permit persistent expression 
of a therapeutic gene in both dividing and non-dividing cells; for example, in differentiated cells, 
5 such as those in brain, and muscle. 

Recombinant vectors of the invention are also useful for high level transient expression in 
cells where desired, such as for cancer therapy or in vivo vaccination. 

Both in vivo and ex vivo gene therapy strategies are possible with this vector system, 
including stable, multicopy gene maintenance and expression, in haemopoietic and other stem 
1 0 cells, and in the committed and differentiated progeny of these cell types. 

For human gene therapy, uses of the recombinant vectors of the invention are not limited 
in terms of delivery of the vector to a cell. That is, vectors of the invention may be delivered to a 
cell via non-viral or viral delivery systems. Delivery systems of non-viral origin include those 
which employ cationic liposomes, where vector size constraints do not limit the nature and 
1 5 number of plasmid vector components. Delivery systems of viral origin include viral particle- 
producing packaging cell lines as transfection recipients for the above E1/E2/M0/MIVIE- 
containing plasmids into which viral packaging signals have been engineered, such as those of 
adenovirus, herpes viruses and papovaviruses. 

Recombinant vectors of the invention also are useful in transgenesis, including production 
20 of transgenic animals via pronuclear injection, or embryonic stem cell transfection and embryo 
chimera generation. 

Recombinant vectors of the invention also are useful for preparation of cell factories for 
stable, high level expression of proteins of therapeutic value in cultured mammalian cells. 

Further features and advantages of the invention will become more fully apparent in the 
25 following description of the embodiments and drawings thereof and from the appended claims. 

Description of the Drawings 

Before describing the invention in detail, the drawings will be briefly described. 
30 Figure 1 A is a scheme of the experimental protocol. 

Figure IB is a short term replication assay for the plasmids in the CH04.15 cells. Low 
molecular weight DNA was extracted from the CH04.15 cells transfected with the plasmids 
contairung minimal origin or 2.5 kb Bglll fragment and analyzed by Southern blotting after 
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digestion with Dpnl and linearizing enzyme Xbal (lanes 1, 2) and Hindlll (lanes 3, 4). Lanes 1,2- 
episomally replicating minimal origin containing plasmid DNA extracted 36 and 60 hours after 
transfection and lanes 3,4 - episomal DNA extracted 36 and 60 hours after transfection with the 
plasmid containing 2.5 kb origin fragment. 
5 Figure 2 A stable replication of the BPV-1 origin containing plasmids in the CH04. 15 cell 

line. Representation of the BPV-1 fragment (BPV nucleotides 6945-1515) used in this 
experiment. The respective mutants in this fragment are depicted and are further described in 
Materials and Methods. The following genetic elements are indicated: NCOR-1 and 2 - Negative 
Control Of Replication 1 and 2; PMS-1 - Plasmid Maintenance Sequence; ORI - minimal origin of 

10 replication; E2RE1 and E2RE2 - E2 responsive enhancer 1 and 2; CE- constitutive enhancer; El, 
E6, E7 and E8 -respective ORF-s; P7175, Pl. Psg, Pgw - respective promoters; boxes indicate 
location of 14 E2 binding sites in this fragment. 

Figure 2B is a southern blot analysis of stable cell lines for the presence of episomal 
plasmids. The marker lanes 1, 2, 3 - contain 100 pages of linearized vector pNeo5', pNeoBgl40 

15 and pNeoBglll, respectively. Low molecular weight DNA was extracted from CH04.15 cells 
after transfection and G418 selection (see Materials and Methods for details). The plasmids used 
were the vector plasmid pNeo5' (lane 5), pNeoXhoI->HpaI with disrupted El protein binding site 
(lane 6), pNeoBgI40 (lane 7), pNeoANCORI with deleted Negative Control Region (lane 8), 
mutants Sma", 775 and 576 with disrupted 5'-part of El, E6 and E6/E7 ORFs respectively (lanes 

20 9-11), the wild type pNeoBglll (lane 12), and the minimal origin containing plasmid pTSeoMO 
(lane 13). 

Figure 3 A shows stable extrachromosomal replication of the plasmids with deletions in the 
URR in CH04.15 cells. Representation of BPV- I fragments with respective deletions.P7i75, P^, 
P7940 - respective promoters in this ft-agment. E2REI and E2RE2 - E2 responsive enhancer, CE 
25 constitutive enhancer, boxes indicate E2 protein binding sites. End points of the respective 
deletions are given in Materials and Methods. Circle indicates the location of the minimal origin. 
Ability of respective deletion mutants to function in long term replication assay is indicated by (+) 
or (.). 

Figure 3B. Low molecular weight DNA was extracted .from the cells, transfected with 
30 respective plasmids and selected for G418, digested with the linearizing enzyme Hind III and 
analyzed by Southern blot (lanes 1-22). Lanes 23, 24 and 25 contain 100 pg of linearized pNeo5' 
vector, pNeoBgl40 and PNeoBglll marker DNA. 
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Figure 4 A represents an analysis of state and copy-number of BPV-1 origin containing 
plasmids in CH04.15 cells. Copy-number measurement of BPV plasmids stably replicating in 
CH04. 15 cells. Total DNA was extracted from the stable cell lines and subjected for linearization 
with Hindlll. Lanes 1-3 represent analysis of 2 |ig of total DNA from three independent cell-lines, 
5 a series of plasmid dilutions for copy-number reconstruction is included in lanes 4-6. 

Figure 4B shows plasmid DNA, wherein a total of DNA 2 |ag, cut with plasmid noncutter 
Apal, from four independent cell-lines is analyzed (lanes 1-4). Respective marker of uncut plasmid 
DNA is shown in lanes 5 and 6. 

Figure 5 shows restoration of stable replication of the plasmids by an MME consisting of 
10 oligomerized E2 binding sites. Low molecular weight DNA was extracted from the G418 
resistant cells and linearized with Hindlll (lanes 1-4) or with Xbal (lane 5). Lane 1 represents 
analysis of the DNA from the cells transfected with original D234/221 mutant with deleted MME. 
Insertion mutants with E2REI (BPV nucleotides 7611-7673) cloned into D234/221 adding back 
18 and 9 E2 binding sites restored stable replication of the plasmid (lanes 2,3). Mutant with 
15 oligomerized 10 E2 binding sites cloned into D234/221 (lane 4) and mutant with deleted PMSl 
with inserted 10 oligomerized E2 binding sites (lane 5) replicate in the long term assay. Lanes 6-8 
contain 100 ng of linearized respective marker DNA. 

Figure 6 shows the transformation efficiency (colonies per 100.000 cells) for G418 
resistance of the plasmids pNeoBgl40 (Bgl40), pNeoXhol-»HpaI (HpaXho), pNeoMO (MO) and 
20 vector pNeoS'. 500 ng of the respective plasmid DNA was electroporated into the cells and 84 
hours later cells were trypsinized, counted and plated. Complete medium containing 450 ng/ml of 
G418 was used for selection. Colonies were counted at the 10th day of selection Presented 
values are average of three independent measurements, A. CH04 J5 cells, B. CH0212 cells and 
C. CH049 cells. 

25 Figure 7A is a schematic illustration of a plasmid in which El expression is under the 

control of the SR promoter. 

Figure 7B is a schematic illustration of a plasmid in which El expression is under the 
control of the Thymidine kinase promoter. 

Figure 7C is a schematic representation of the plasmid maps shown in Figs. 7 A and 7B. 
30 Figure 8 represents a comparison of the stable replication modes of BPV origin containing 

plasmid and chromosomal DNA in the CH04.15 cells. BrdU labeling of the CH04.15 cells 
carrying stably replicating pNeoBgl40 was done for a) and e) - 3.5, b) and f) - 9.5, c) and g) - 
15.0 and d) and h) - 24 hours, respectively. Episomal - a), b), c), d) and total chromosomal DNA 
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- e), f)» g). h) were prepared at respective time points and analyzed as described in Materials and 
Methods. CsCl gradients were aliqoted, denatured, renatured and loaded onto the nylon fiUers by 
slot-blotter and hybridized with radioactive BPV-1 origin probe for episomal DNA and with 
radioactive total CHO DNA for genomic DNA gradients. Intensity of hybridization was 
5 quantitated by use of the Phosphoimager. 

Figure 9 shows the copy number of BPV-1 origin-bearing plasmids is similar in Gl and G2 
phases of the cell cycle. Southern blot analysis of plasmid copy number in Gl and G2 phases of 
the cell cycle. Derivatives of CH04.15 cell line with stably, extra chromosomally replicating BPV- 
1 origin-containing plasmids were arrested in Gl/S with mimosine or hydroxyurea and in G2 with 

10 hydroxyurea, followed by Hoecsht 33342 treatment. Arrest was verified with FACS analysis, total 
DNA was extracted and an equal amount of DNA was loaded onto each lane. Analysis of four 
different established cell lines is shown on the figure with 3 parallels for each cell cycle arrest. 

Figure 10 shows transient replication of pSR alpha and pTk vectors (Figure 7) in CHO 
(lane 1-4) and C333A (lanes 7-10) cell lines. 2 micrograms of respective plasmid was transfected 

15 into the cells by electroporation, time points were taken 36 and 48 hours post-transfection and 
low-molecular weight DNA was extracted, digested with Dpnl and Hindlll and analyzed by 
Southern blotting. Lanes 1-2 and 7-8 represent transient replication of SR alpha vector in CHO 
and C33A cells respectively. Over-replication is clearly visible in case of both cell lines. Lanes 3-4 
and 9-10 represent transient replication of TK vectors in CHO and C33A cells respectively. Some 

20 over-replication is detectable in C33A cells, whereas replication of TK vector in CHO cells is 
weak, but without smear. The latter is characteristic for onion-skin type over-replication. Lanes 5- 
6 and 1 1 represent respective molecular-weight markers. 

Figure 1 1 shows beta-galactosidase activity in CHO cells transfected with equimolar 
amounts of vector plasmids. In plasmids pON260 beta-galactosidase gene is expressed from 

25 immediate early CMV promoter. In other vector constructs beta-galactosidase is expressed from 
RSVLTR 

Figure 12 shows beta-galactosidase assay for the cells transfected with the pSR alpha and 
pTK constructs after 56 hours. In the case of TK constructs formation of lacZ- positive blue 
colonies could be detected, however in the case of SRalpha constructs only single, strongly 
30 stained, blue cells could be detected in the culture indicating that plasmids are not inherited to 
each of the daughter cells during cell division. 

Figure 13 A shows a number of colonies per 10000 cells transfected with equal molar 
amounts of vectors selected for G418. 
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Figure 13B is an analysis of the episomal DNA in the cells transfected and selected for 
G4 18 with the TK-El constructs. 

Figure 14 A is a schematic representation of the designed E2 point mutations and chimeric 
E2 proteins. 

5 Figure I4B shows an immunoblots of the E2 proteins. 

Figure 15A shows the transient replication properties of the mutated E2 proteins in CHO 

cells. 

Figure 15B shows the structure of the reporter plasmids used. The numbers indicate 
nucleotide positions in the BPV URR sequence. pUCAlu was used for transient replication 
10 studies. pPCAT and pSV3BS9CAT are the CAT reporter plasmids used in transcriptional 
activation assays. 

Figure 16 represents a comparison of transactivation and DNA binding abilities of E2 
protein mutants. 

Figure 17 shows results of a transient transcription assay for the E2 protein mutants. A 
15 Mutations with nearly wild-type properties in transient transcription assay. B. Mutations, which 
transcriptional activity has decreased to 50% of that of the wild-type protein. C. Inactive 
mutations for transcription. D. Transcriptional properties of chimeric proteins p53:E2 and 
VPI6:E2. 

Figure 18 shows results of staining for beta-galactosidase activity in brain striatum 
20 sections, where panels A-F represent plasmid DNA dissolved in cerebrospinal fluid (A-D) or PBS 
(E. F). 

Figure 19 is a bar graph showing results of P-galactosidase reporter gene expression in 
mice after injection of a vector of the invention containing a P-galactosidase gene. 

Figure 20 provides the nucleotide and amino acid sequence of the BPVl E2 sequence 
25 (SEQ ID NO: 1). The positions which were mutated to produce E2 point mutants are designated 
as * and are referred to in the text by amino acid residue. The E2 protein sequence begins at the 
MET indicated as 1 . 

Figure 21 provides the nucleotide sequence of the BPV upstream regulatory region (SEQ 
ED NO:2). The MME sequence is located between positions 7475(Cla 1 site) and the Hpa 1 site 
30 (7947). 

Description 
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The invention is based on the recognition that DNA replication in papillomavirus from the 
minimal origin (MO) per se is not sufficient for long-term persistence, but that in addition another 
viral sequence is required. This sequence, termed herein a minichromosomal maintenance element 
5 (MME) comprises a binding site for proteins which are essential for papillomavirus replication. 
Although the MME sequence may include binding sites for replication proteins that are of viral or 
human origin, for example, in BPV, the sequence appears to be dependent on viral proteins El 
and E2. and is specifically bound by the viral E2 protein, when the sequence is linked to the 
minimal origin sequence. 

10 The invention thus is based on the discovery of a cis-acting element, referred to herein as a 

minichromosomal maintenance element (MME), which confers long-term stability to a transiently 
replicating eukaryotic episomal plasmid. An MME is distinct from a minimal origin of replication 
(MO), and is required in addition to an MO for long-term plasmid persistence in a host cell. 

1 5 RECOMBINANT VECTORS OF THE INVENTION 

The invention encompasses a genetic construct comprising a single plasnnid containing the 
MO and MME sequences (MO/MME vectors) and a cloning site for insertion of a gene of 
interest. The invention also encompasses a genetic construct comprising a single vector containing 

20 genes encoding El and E2 plus MO and MME sequences (E1/E2/M0/MME vectors) ^and a 
cloning site for insertion of a gene of interest. In its most useful aspect, the invention features the 
above described plasmids wherein a gene of interest also is contained in the plasmid 

Optionally, vectors of the invention may include multiple cloning site cassettes and 
selectable markers conferring drug resistance to mammalian and bacterial cells, and reporter genes 

25 such as lacZ (Figure 7). These vectors can be used as stable expression vectors in a wide range of 
both dividing and non-dividing (post-mitotic) cell types. 

An important property of a plasmid containing these determinants is that replication of this 
episomal plasmid is not subject to regulation by the cellular controls which regulate host genome 
replication; that is, replication occurs independently and is under El /E2 control in the S-phase of 

30 the cell cycle. (Figure 8). The initiation of El/E2-dependent replication of MO/MME- containing 
plasmids is random in the cell cycle but over- replication does not occur; stable copy number is 
maintained in both Gl and G2. (Figure 9). 
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Vectors of the invention are safe to use in human cells and impart no known oncogenic 
properties to recipient cells. AJl papilloma-encoded oncogenic sequences have been deleted. (Figs. 
2 and 3). 

5 Definitions 

As used herein, "papillomavirus" refers to a member of the papilloma family of viruses, 
including but not limited to bovine papillomavirus (BPV) and human papillomavirus (HPV). 

"Minimal origin of replication" (MO) refers to a minimal cis-sequence within a 
10 papillomavirus that is necessary for initiation of DNA synthesis. The MO of BPV- 1 is located at 
the 3' end of the upstream regulatory region within a 60 |a fork, corresponding to a 52 base pair 
DNA fragment (7928-7947/1-25) including an AT-rich region, a consensus sequence to which all 
papilloma viral E2 proteins bind, and an El protein binding site spanning nucleotide 1 . The MO of 
HPV is located in the URR fragment (for example, in HPVl 1 at nt 7072-793 3/1-99) (Chiang ei 
15 al, PNAS 1992). 

In a transient replication assay, the efficiencies of replication of plasmids bearing a 
Minimal Origin (MO) and a full size Origin of Replication are equal. Only two viral proteins. El 
and E2, are required for stable replication of the full size Origin. An observation which led in part 
to the discovery which forms the basis of the invention is that the minimal origin of replication 
20 (MO) is absolutely essential, but it is not sufficient to stably maintain the plasmids in an episomal 
state; additional elements in the Upstream Regulatory Region are not only required, but are 
sufficient, for stable persistence of the plasmids (Figure 2). 

"El " refers to the protein encoded by nt 849-2663 of BPV subtype 1 ; or to nt 832-2779 of 
HPV of subtype 11, or to equivalent El proteins of other papillomaviruses, or to functional 
25 fragments or mutants of a papillomavirus El protein, i.e., fragments or mutants of El which 
possess the replicating properties of El. 

"E2" refers to the protein encoded by nt 2594-3837 of BPV subtype 1; or to nt 2723-3823 
of HPV subtype 11, or to equivalent E2 proteins of other papillomaviruses, or to ftmctional 
fragments or mutants of a papillomavirus E2 protein, i.e., fragments or mutants of E2 which 
30 possess the replicating properties of E2. Numerous E2 mutants are described herein which are 
defective in the E2 transcriptional activating activity and competent in the replicating ability. 

"Minichromosomal maintenance element" (MME) refers to a region of the papilloma viral 
genome to which viral or human proteins essential for papilloma viral replication bind, which 
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region is essential for stable episomal maintenance of the papilloma viral MO in a host cell. 
Preferably, the MjME is a sequence containing multiple binding sites for the transcriptional 
activator E2, although the MME also may be a sequence containing host cell factor binding sites. 
An MME is most likely to be located in the upstream regulatory region of the viral genome. The 
5 MME in BPV is herein defined as the region of BPV located within the upstream regulatory 
region which includes a minimum of about six sequential E2 binding sites, and which gives 
optimum stable persistence with about ten sequential E2 binding sites, and which may include as 
many as about 20-30 or as many as about 50 sequential E2 binding sites. The sequential binding 
sites which constitute the MME need not be identical in sequence, but must be able to bind £2. In 

10 addition, between each sequential binding site in the MME, there may be spacer nucleotides, for 
example, 6 nucleotides, sufficient to insert a restriction enzyme site. The spacer nucleotides may 
be absent from the MME or may extend to a length of 10, 20 or even 50 nucleotides, so long as 
the binding of E2 to each separate binding site is not disrupted by the presence of the spacer. 

The Minichromosome Maintenance Element (MME) of BPV comprises multiple, binding 

15 sites for the E2 protein (Figure 3). 10-20 tandem repeats of E2 binding sites impart greater 
stability and higher plasmid copy number (approx. 30 copies per cell. Figure 4) in cells expressing 
the BPV (or HPV) El and E2 proteins. (Figure 5). It is believed that MME/MO-bearing plasmids 
function in a wide range of eukaryotic cells including rodent, monkey and human cells, and in 
almost any tissue type. 

20 The MME confers (El+E2)-dependent stable replication upon plasmid transfection mto a 

host cell line expressing both El and E2. It is observed that neither El nor E2 alone is sufficient 
to permit MME-mediated plasmid stability (Figure 6). El and E2 can be provided to the plasmid 
either in cis or in trans (i.e. from integrated El and E2 expression vectors or from the same 
episomal plasmid). 

25 "E2 binding site" refers to the minimum sequence of papillomavirus double-stranded DNA 

to which the E2 protein binds. This binding site is in most papillomaviruses located in the 
upstream regulatory region in BPV and HPV. In BPV, the E2 binding site is a palindromic 12 
nucleotide sequence (ACCN6GGT, where N is any nucleotide) which is repeated approximately 
10 times within the URR. The affinities of the 10 E2 binding sites for E2 varies among the binding 

30 sites in the BPV URR, with site 9 (ACCGN4CGGT, where N is any nucleotide) being the 
strongest E2 binding site. 
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A "host cell" which is stably transformed according to the invention may be any 
prokaryotic or eukaryotic cell, an is preferably a mammalian cell, and most preferably a human 
cell. The cell may be derived from any tissue, for example, muscle, nerve tissue, etc. 

When the El and/or E2 genes are located in cis with respect to the MO and MME, this 
5 refers to a genetic context in which one or both genes are located on the same episomal element 
or the same vector as the MO or MME. In contrast, when the El and/or E2 genes are located in 
trans with respect to the MO and MME, this refers to a genetic context in which the El and/or E2 
genes are not located on the same episomal element as the papilloma MO or MME, such as a 
context in which the Ei and/or E2 genes are integrated into the host cell chromosome or the El 

10 and E2 genes are carried on a separate (non-contiguous) genetic element (vector or episome). 

"Stable maintenance" or "long-term persistence" refers to two characteristics of vectors of 
the invention. First, it refers to the ability of a vector according to the invention to persist or be 
maintained in undividing cells or in progeny cells of dividing cells in the absence of continuous 
selection over the long-term. As used herein, "long-term" refers to a period of time that is longer 

1 5 than at least about 5 weeks, for example longer than 8 weeks (where a given cell doubling time is 
about 16 hours). Of course, for a longer or shorter cell doubling time, the definition of "long- 
term" changes accordingly. Second, it refers to the ability of a vector according to the invention 
to persist without an appreciable loss of copy number from one cell division to the next. In 
determining whether a given vector is capable of long-term persistence, the recombinant vector 

20 may be introduced into the host cell under conditions in which the vector is selected for, when the 
selection conditions are thereafter removed, the copy number of the recombinant vector 
nevertheless remains constant and reliable thereafter; for example, in non-dividing cells, a vector 
of the invention persists for a period that is longer than at least 5 weeks, for example, 6 weeks, 8 
weeks, 12 weeks or longer, for a host cell which doubles in about 16 hours. A vector of the 

25 invention is capable of persisting, in dividing cells, over about at least 50 generations, 60 
generations, 80 cell generations, or 120 cell generations or longer under non-selective conditions 
without an appreciable loss of copy number. We have found that loss of copy number over 80 
generations of cell doublings is less than about 10% (over 80 generations). Therefore, loss of 
copy number from one cell generation to the next is less than about 0,125% or about 0.1%. 

30 Therefore, the vectors of the invention are not subject to an appreciable loss of copy number from 
one cell generation to the next. In contrast, "short-term persistence" of a plasmid in a host cell 
refers to the inability of a plasmid to persist in a host cell long-term, as defined herein, without an 
appreciable loss of plasmid copy number, as defined herein. 
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A "gene of interest" refers to a gene encoding a gene product of interest such as a protein 
of interest or an RNA of interest. A "protein of interest" refers to any therapeutic, prophylactic or 
marker protein useful according to the invention. In addition to any therapeutic protein selected 
for a treatment using a vector of the invention, a therapeutic protein may include a cell or viral 
5 surface antigen to which an immune response may be elicited when used in a vaccine. 

"Heterologous" or "erogenous" gene refers to a coding sequence that is introduced into a 
host cell on a vector of the invention. The coding sequence may be identical to a sequence 
contained within the host cell, or it is most likely not identical to a host cell sequence. 

"Heterologous promoter" refers to a promoter that is not the natural (or homologous) 
10 promoter that initiates transcription from the gene with which it is associated, whether that gene 
be the gene encoding a protein of interest or the genes encoding the El and E2 proteins. A 
'^strong" promoter, with respect to El and E2 expression, refers to a promoter which supports 
overexpression of the El and/or E2 gene to an extent that the vector is mamtained in sufficiently 
high copy number so as to make the host cells unhealthy. An example of a strong promoter is the 
1 5 SR-alpha promoter; other strong promoters will provide about the same level of expression of a 
reporter gene in an assay for quantitating gene expression and thus promoter strength, e.g., a 
betagalactosidase assay. 

As explained hereinabove, a vector useful in the invention will contain papillomavirus MO 
and MME sequences, as defined herein, and a cloning site for insertion of a gene of interest, and 
20 will require the presence of the papillomavirus El and E2 proteins for long-term persistence in a 
host cell The El and E2 proteins are effective in trans in the cell and therefore the genes 
encoding these proteins may be present in trans with respect to the vector, or they may be present 
in cis, i.e., within the vector DNA. In addition to the above-described sequences, a vector useful 
in the invention may contain a heterologous gene encoding a protein of interest and also may 
25 contain sequences for regulation of that gene. 

Described below are experiments in which the BPV MME is localized and characterized. 
This experimental strategy also is useful for localizing and characterizing the HPV MME. 
Following the description of the BPV MME characterization is a description of the components 
of vectors of the invention; i.e., heterologous genes of interest in vectors of the invention, 
30 heterologous regulatory sequences^ e.g., promoters, which may be used to direct El and E2 gene 
expression and to direct heterologous gene expression, E2 mutants useful according to the 
invention, host cells usefial in the invention, and delivery vehicles useful for delivering vectors of 
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the invention to host cells. The Examples provided hereinbelow describe specific embodiments of 
the invention and are meant to illustrate and not to limit the applicability of the invention. 

Localization of BPV MME Sequence 

5 

We have constructed a cell line CH04.15 expressing constitutively the viral proteins El 
and E2, that are required for initiation of viral DNA replication. It has previously been 
demonstrated that the minimal origin of replication and the viral El and E2 proteins are sufficient 
for plasmid replication. It is demonstrated herein that the El and E2 viral proteins are not only 

1 0 necessary, but are sufficient when coupled with the MME for long-term episomal persistence. 

Using the cell line CH04.15 it is shown herein that the BPV-1 minimal origin of 
replication (MO) is absolutely necessary, but is not sufficient for stable extrachromosomal 
replication of viral plasmids. By deletion and insertion analysis, an additional element 
(Minichromosome Maintenance Element - MME) in the Upstream Regulatory Region of BPV- 1 

1 5 has been identified which assures stable replication of the MO containing plasmids. This element 
is composed of multiple binding sites for the transcriptional activator E2. MME appears to 
function in the absence of replication but requires El and E2 proteins for activity. In contrast to 
EBV or EBV oriP-containing plasmids, for example, stably maintained BPV-1 plasmids are not 
subject to once-per-cell cycle replication as determined by density labeling experiments. These 

20 results indicate that papillomavirus episomal replicators replicate independently of the 
chromosomal DNA of their hosts 

Construction of cell lines expressing El and E2 proteins 

25 The El and E2 proteins of BPV-1 are necessary for initiation of DNA replication from the 

viral origin of replication. Expression of these two proteins from heterologous expression vectors 
allows replication of a minimal origin in transient replication assays ( Ustav and Stenlund, 1991, 
Ustav et aL, 1991). However, due to the lack of persistence of the transfected expression 
vectors, replication can not be monitored for more than few days after transfection. 

30 To determine whether additional /ran^-acting factors or c/\y-acting elements are required 

for long term persistence of the viral DNA, continuous expression of these two factors has to be 
assured. We therefore constructed several cell lines constitutively expressing the El and E2 
proteins. Expression of these proteins was directed from integrated constructs for El protein from 
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CMV promoter (cell line CH0212) and for E2 protein from HAP 70 promoter (cell line CH049). 
In the cell line CH04.15 which expresses both EI and E2, the El protein was expressed from 
SRa-promoter and the E2 protein from HSP 70 promoter. Selection of the respective cell lines 
and amplification of the expression units of interest was achieved by utilizing the glutamine 
5 synthetase minigene from the PSVLGS.l plasmid according to the protocol described earlier 
(Bebbington and Hentschel, 1987). Expression of El and E2 was identified by 
immunoprecipitation using specific rabbit polyclonal sera (data not shown) and by in vivo 
replication assays. The three cell lines and the parental CHO cells were transfected with the BPV- 
1 origin containing plasmid pUC/Alu in combination with El and E2 expression vectors. The cell 
10 line CH04.15 which expresses both El and E2, supports replication of the origin plasmids in the 
absence of erogenous El and E2. The E2 expressing cell line, CH049, supports replication in the 
presence of an El expression vector, but fails to do so without erogenous El. The El expressing 
cell line, CH0212, supports replication only in the presence of an E2 expression vector. In the 
parental CHO cell line, co-expression of both El and E2 is required for replication. No 
1 5 replication of pUC/Alu can be detected in the absence of El and E2. 

Described below are experiments which demonstrate the trans and cis-acting elements 
which are not only necessary but are sufficient for long-term episomai persistence. The El and 
E2 proteins are both necessary and sufficient for stable replication of the BPV- I origin containing 
plasmids. In addition, the BPV- I URR contains sequences which are not only required,, but are 
20 sufficient for long-term replication of the papilloma-derived episome. 

This experimental strategy and procedure also may be used to determine the exact location 
in HPV of the MME, simply by substituting HPV for BPV, and testing regions of the HPV 
genome. Such testing would begin with the URR of HPV, which region includes binding sites for 
E2 and for cellular factors involved in replication. 
25 The experimental protocol used for defining the BPV MME was as follows. 

To determine what viral /ra/75-acting factors and c/j-sequences were required for long 
term replication of BPV- 1, we used the CH04.15 cell line which constitutively expresses both the 
viral El and E2 proteins. Different fragments of BPV-1 were cloned into the vector pNeo5' 
(Lusky and Botchan, 1984 ). This plasmid provides amino glycoside 3 '-phosphotransferase as a 
30 marker for selection of the cells in the presence of geneticine (G4I8). We used a 2.5 kb Bglll 
fragment from the BPV-1 genome (nucleotides 6946-1515) as a starting fragment. This fragment 
contains the URR including the E2 dependent transcriptional enhancer, the minimal origin of 
replication and part of the early ORF's (construct 12, Figure 2A). This plasmid, in parallel with a 
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minimal origin fragment in the same backbone (construct 13, Figure 2 A), were transfected into 
the CH04. 15 cell line by electroporation and processed according to the scheme in Figure 1 A. 

In Fig 1 A, both transient and long-term plasmid persistence was determined. A portion of 
the transfected cells were used after plating for analysis of transient replication. The remaining 
5 portion of the cells were selected in the presence of G418 for three weeks, colonies were then 
pooled or picked and grown under nonselective conditions for two additional weeks, to give a 
total of five weeks, at which time low molecular weight DNA was harvested and analyzed for the 
presence of replicated plasmid. The ability of the origin-containing plasmids to replicate extra 
chromosomally in transient and long term replication assays was examined by Southern analysis of 

10 the episomal DNA (see Materials and Methods for details). The two plasmids containing the 2.5 
kb Bglll fragment and the minimal origin respectively, replicated to comparable levels in the 
transient replication assay ( Figure IB). After selection in the long term replication assay, 
however, the result was very different. While the plasmid containing the 2.5 kb Bglll fragment 
could be readily detected in episomal form, the minimal origin containing plasmid could not be 

15 recovered (compare lanes 12 and 13, Figure 2B). These results showed that sequences present in 
the larger plasmid, but absent in the plasmid containing only the minimal origin were required for 
long term replication, and that an activity in addition to replication was required for stable 
persistence of a plasmid in the CH04.15 cells. These results also indicated that it was possible to 
maintain BPV origin containing plasmids for an extended period of time (e.g., five weeks) without 

20 the regulatory circuit that the viral genome can provide. 

To determine what sequences within the Bglll fragment were responsible for maintenance 
in the long term assay, we generated mutations within that fragment and assayed these plasmids 
for maintenance. Initially, we generated mutations in the sequences suggested previously to have 
effects on replication. The BPV- 1 Bglll fragment contains coding sequences for three potential 

25 proteins, E6, E7 and the N-terminal part of the El protein. Mutations which interrupted El ORF, 
E6 ORF and E6/7 ORF (Lusky and Botchan, 1985; Schiller e( ai, 1984; Berg et al., 1986) - 
construct 9 (pNeo Sma), construct 10 (pNeo775), construct 11 (pNeo576), respectively and in 
addition a deletion removing all coding sequences - construct 7 (pNeoBgl40), were introduced 
into the pNeo5' (Figure 2A). None of these mutations had a detectable effect on maintenance 

30 (compare lane 12 with lanes 7, 9, 10 and 11 Figure 2B) indicating that the coding sequences 
contained within the Bglll fragment were dispensable. Consequently the El and E2 are the only 
viral gene products required for maintenance. 
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It has been suggested previously that BPV-1 URR contains two partially overlapping cis - 
regulatory control elements for stable replication, termed Plasmid Maintenance Sequence (PMS- 
1) (Lusky and Botchan, 1984) and Negative Control of Replication (NCOR-1) (Roberts and 
Weintraub 1986). In order to demonstrate that these two sequences were not necessary or 
5 sufficient for long-term plasmid persistence, each sequence was deleted by removing the sequence 
between the Hindlll and MIuI sites (nt. 6959-7351, construct 8 - pNeoANCOR, Figure 2 A). This 
deletion had no deleterious effect on long term replication of the plasmid (lane 8, Figure 28), 
demonstrating that these putative elements were not required. Finally, as a negative control, an 
Xhol linker insertion mutant overlapping with Hpal site was generated (construct 6, Figure 2A). 
10 This mutation generated an origin defective for replication in the transient replication assay, and it 
is also defective for long term replication (lane 6, Figure 2B). We concluded from these results 
that ds-elements required for stable replication of BPV-1 are located within the URR and are 
unrelated to the previously proposed elements PMS-1 and NCOR-1. We have named this ds- 
element in the BPV-1 URR Minichromosome Maintenance Element (MME). 

15 

MINICHROMOSOMAL MAINTENANCE ELEMENTS OF THE INVENTION :> 

A minichromosome maintenance element is localized and defined as follows. 

20 The Minichromosome Maintenance Element is composed of redundant sequences. 

To define the sequences required for long term replication we generated a series of 
deletion mutants within the URR (Figure 3 A). These deletions were made in the context of the 
plasmid pNeoBgWO (BPV-1 sequence from 6946 to 63), and the deletion mutants were tested in 

25 the long term replication assay. The first series of constructs (1 -6, Figure 3 A) had a fixed 5 -end 
(nucleotide 7187) and progressive deletions were made at the 3'-end of the URR (nucleotides 
7892, 7834, 7771, 7475, 7389, respectively). The first four of these deletions (constructs 1-4, 
Figure 3 A) were defective for stable replication (Figure 3B), but the plasmids with less extensive 
deletions (plasmids 5 and 6, Figure 3 A) were maintained at the wild type level (lane 5 and 6, 

30 Figure 3B). Another set of deletions (constructs 7-11, Figure 3 A) had a fixed 3'-end (nucleotide 
7890) and progressive deletions from the 5'- end (7476, 7611, 7673, 7771, 7834, respectively). 
Two mutants of this series were unable to replicate stably (deletions 7 and 8, Figure 3 A and 3B), 
but plasmids with less extensive deletions replicated efficiently (lanes 9-11, Figure 3 A and 3B). 
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The results from the unidirectional deletions showed that a sequence in the vicinity of nucleotide 
7600 was required for long term replication (compare constructs 4 and 5, and 8 and 9). To map 
this sequence more precisely we generated a third set of deletions. These deletions were 
constructed as scanning deletions (constructs, lanes 11-21, Figure 3A, 3B). Surprisingly, none 
5 of the introduced mutations resulted in loss of MME activity demonstrating that no single unique 
sequence within the URR was required, but that maybe some redundant sequence element was 
responsible. 

The MME is composed of Binding sites for the E2 transcription activator. 

To address directly if the sequences in the vicinity of nucleotides 7600 were responsible 
for the MME activity we generated a fragment between nucleotides 7590 and 7673, and inserted 
this fragment into the deletion mutant D22 1/234 (construct 1 in Figure 3 A) to determine if 
replication in the long term replication assay could be restored (this fragment corresponds to the 

15 sequence between the deletion end-points D 134 and Dll). This fragment insened in three and 
six copies restored MME activity in the long term replication assay (Figure 5, compare lanes 1 
with 2,3). A known constituent of this fragment are three high affinity E2 binding sites. A 
possibility that occurred to us was that the MME activity was contributed by E2 binding sites. 
This would be consistent with the apparent redundancy, since the URR contains 1 0 binding sites 

20 for E2. To determine if E2 binding sites were involved in MME activity we oligomerized a high 
affinity E2 binding site 9 of the BPV-1 URR (5'-ACCGTTGCCGGT-3') with six nucleotide 
spacing (Li et aL, 1989) and inserted these oligomers (10 copies) into D221/234 deletion mutant. 
This insertion restored the MME activity (lane 4, Figure 5). To rule out involvement of other 
BPV sequences we added 10 oligomerized E2 binding sites to the minimal origin of replication. 

25 Those constructs replicated with similar efficiency as plasmids with wild type BPV-1 sequences in 
the stable assay (lane 5, Figure 5). However, plasmids v/ith less than six additional oligomerized 
E2 binding sites failed to replicate in the long term replication assay (data not shown). These 
resuhs strongly suggest that binding sites for the E2 protein can be responsible for providing 
MME activity to the BPV-1 origin. 
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The MME enhances the frequency of formation of G418-resistant colonies without 
replication. 

It has previously been observed that for EB V, multimerized EBNA- 1 binding sites (Fannily 
5 of Repeats - FR) in an EBNA-1 dependent fashion are required for stable replication of oriP 
containing plasmids (Krysan et aL, 1989; Kirchmaier and Sugden, 1995; Middleton and Sugden, 
1994). This activity can be measured by increased transformation frequency of the plasmids 
carrying FR, and is thought to be caused by enhanced nuclear retention of plasmids containing 
FR. To determine if a similar activity could be determined for MME we measured the 

10 transformation frequency of four different plasmids. First, pNeo5' carries the selectable neomycin 
resistance marker, but lacks BPV sequences and consequently is defective for replication in both 
the short term and long term replication assays. Second, the minimal origin plasmid, in addition 
carries the BPV minimal origin and is replication competent in the short term replication assay but 
not in the long term assay. The third plasmid pNeoHpal^XhoI carries the whole Bgl40 

1 5 fi-agment, and is thus nominally capable of maintenance, but because of the linker insertion in the 
El binding site the plasmid is defective for replication. The fourth plasmid pNeoBglAO is 
replication competent in both the short and long term replication assays. We used CH04.15 cells 
to measure transformation frequency of these different plasmids (Figure 6). The vector with the 
selectable marker only and the plasmid containing the minimal origin transformed CH04.15 cells 

20 with similar firequency, 14 and 16 colonies per 10^ cells, respectively. In the parallel experiment, 
pNeoHpaI->XhoI which is replication defective with a mutant El binding site, but carrying the 
sequences required for MME activity, transformed CH04.15 cells 4 to 5 times more efficiently 
than vector alone or a plasmid with minimal origin of rephcation (68.6 colonies per 10* cells) 
(Figure 6A). The plasmids with the complete origin transformed cells with approximately 100 

25 times higher efficiency than the vector containing only the minimal origin of replication 
(approximately 1600 colonies per 10^ cells). These results indicated that MME activity could be 
measured in a stable transformation assay even in the absence of replication. When the same 
experiments were also performed in CH0212 (El cell line) and CH049 ( E2 cell line), all 
plasmids transformed with approximately the same efficiency in these two cell lines (Figures 6B 

30 and 6C). We conclude that enhanced transformation activity requires both El and E2 proteins. 

We measured the possible effect of an MME consisting of oligomerized E2 binding sites 
on plasmid retention in the short term assay, as it has been done with the EBV oriP containing 
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plasmids. However, attempts to reproduce a direct effect on nuclear retention in transient assay 
failed to show a significant effect (data not shown). 

BPV origin plasmids replicate approximately at 15 copies per haploid genome. 

5 

One of the factors that is expected to influence the stable persistence of a plasmid is the 
copynumber. We therefore performed experiments to estimate the average number of episomal 
copies per haploid genome in different established cell lines. After digestion with the single-cut 
restriction endonuclease (Hind III) total DNA from three independent cell lines • pNeoBgl 40, 

10 pNeo41/Cla and pNeoSca/234 was loaded in equal amounts onto the gel and was analyzed by 
Southern blotting using radioactively labeled BPV-1 origin and neo probe. All three cell lines 
contained approximately the same number of episomal plasmids - 15 copies per haploid genome 
(Figure 4A). Even though no specific effort was made to determine the number of integrated 
copies, digestion with a non-cutter enzymes did not change the appearance of the three forms and 

15 oligomers of the plasmid (Figure 4B). Consistent with previous reports, the majority of the 
plasmids were present in the oligomeric form. We conclude from these results that the plasmids 
are mostly episomal in the CH04. 15 cell line under the conditions used. 

Mode of replication 

20 

One explanation for the apparent high stability of BPV- 1 plasmids in the cells could be 
that the plasmids are subject to the cellular once per cell cycle replication control. To determine if 
this was the case we performed density labeling experiments using the cell line CH04.15 
containing the replicating plasmid pNeoBgl40. The experiments were performed by continuous 

25 labeling of the cells with BrdU for 3.5, 9.5, 15 or 24 hours. Low molecular weight DNA and 
total chromosomal DNA were extracted after each time point and analyzed by CsCl gradient 
centrifiigation, followed by slot blotting, and hybridization with plasmid probe or genomic DNA 
probe to identify the peaks in the gradient. The density gradient profiles are shown in Figure 8. 
The data is sunrmiarized in the table as fractions of Bgl40 DNA and CHO chromosomal DNA that 

30 had incorporated no BrdU, (light-light- LL), BrdU incorporated into one strand (heavy-light - 
HL), or into both strands (heavy-heavy - HH) in the CH04,15 cells stably transformed by this 
plasmid (Figure 8). After labeling with BrdU for 3.5 hours (panels a) and e)) the episomal BPV-1 
origin containing plasmids were divided between three forms of DNA: 5% heavy-heavy, 19% 
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heavy-light and 76% light-light, while chromosomal DNA is distributed between two forms 27% 
heavy-light and 73% light-light. After labeling for 9.5 hours, the plasmid has accumulated 
considerable amount (21%) of heavy-heavy DNA, while chromosomal DNA shows no detectable 
signal in the heavy-heavy area. After labeling for 1 5 hours, distribution of the episomal DNA is 
5 34% heavy-heavy, 38% heavy-light, 28% light-light. At the same time chromosomal DNA 
showed still very little, if any, heavy-heavy DNA. After labeling for 24 hours, episomal DNA was 
preferentially in the heavy-heavy ft-action of the DNA (66%), 24% heavy-light and 10% light- 
light, while chromosomal DNA showed considerable amount of heavy-heavy DNA (24%), but 
with most of the DNA still in the once replicated DNA fraction. These results are consistent with 

10 a doubling time of approximately 16 hours for the pNeoBgl40 containing cell lines. The 
considerable percentage of the unreplicated chromosomal DNA after 24 hours is likely to be due 
to growth arrest of a fraction of the cells by the conditions used for BrdU labeling. It appears 
clear from these results that the stably maintained pNeoBgl40 plasmid does not replicate once per 
cell cycle and stable persistence of the BPV-l plasmids is not a ftinction of once per cellcycle 

1 5 replication control. 

Mechanism of Action 

Three mechanisms of action can be envisioned for the MME. First, the MME could affect 
20 the efficiency of initiation of replication. Although no difference in replication initiation can be 
detected during the time-course of the transient replication assay (Ustav ei aL, 1991), it is 
conceivable that a gradual accumulation of methylated residues at the origin of replication or 
some other form of modification (including nucleosome occlusion) could prevent initiation of 
replication and results in gradual loss of replication activity. It is possible that MME can affect 
25 these processes and prevent inactivation of the origin. Alternatively, the minimal origin 
containing plasmids are competent for over replication during the S-phase of the cell cycle, which 
could be toxic to cells. MME could prevent this process, in analogy to the function of iterons for 
certain bacteriar plasmids (for review Nordstrom, 1991). Third, MME could influence the 
partitioning process thereby affecting the loss rate of plasmids during cell division. 
30 It is interesting to note that the ability of the minimal origin containing plasmids to 

replicate appears to have no detectable effect on transformation frequency. One might have 
expected that an increase in the quantity of plasmid DNA in the cells as a result of replication 
would lead to a higher frequency of integration. However, this appears not to be the case, 
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possibly because these minichromosomes are poor substrates for the required recombination 
events or are lost with very high frequency at cell division. The very large increase in 
transformation frequency of the plasmids with both MO and MME compared to integrating 
marker presumably reflects the fact that the two functions together can bypass the requirement for 
5 integration possibly by providing an efficient segregation/partitioning function in addition to 
replication. 

Materials and Methods 

10 The following methods are routinely used in the invention. These methods are stated in 

terms of the detailed experiments performed herein. However, each method may be generalized 
by one of skill in the art for use in carrying out the invention in its broadest sense, as described 
below and claimed. 

15 Plasmid construction, (i) Expression vectors. The El and E2 protein expression vectors 

pHSE2 (S2ymanski and Stenlund, 1991), pCGE2 and pCGEag (Ustav and Stenlund, 1991) have 
been described earlier. The El expression vector pEl - 1x5 contains the BPV-1 El ORF with 
Xhol linkers (within nucleotides 619 to 2757) and carries a point mutation at the splice donor site 
at nucleotide 1235. This fragment was cloned into the Xhol site downstream of the SRa - 

20 promoter in the plasmid pBJ5GS (kind gift from Dr. L.Berg). 

(u^ Origin plasmids. All origin fragments of the BPV-1 genome were cloned in sense 
orientation into the BamHI site of the pNeo5*(Lusky and Botchan, 1984). pNeoBglll contains a 
Bglll fragment from BPV-1 ( nt. 6946 to 1515) cloned in the sense orientation relative to the 
transcription of the neo gene. pNeoXhoI— >HpaI contains the same Bglll fragment with an Xhol 

25 linker insertion into the Hpal site ( Ustav et ai, 1991). pNeo 576, pNeo775 and pNeoSma* 
plasmids contain the same Bglll fragment with the mutations 576, 775 and Sma- which affect 
E6/7 ORF, E6 ORF and the 5'-part of the El ORF, respectively (Lusky and Botchan, 1985; 
Schiller et ai, 1984; Berg et ai, 1986). pNeoANCOR has a deletion between Hindlll (nt. 6958) 
and Mlul (nt.7351). pNeoBgl40 contained a BPV-1 fragment from nucleotide 6946 to 63, which 

30 was amplified by PGR using respective primers and cloned in sense orientation into the BamHI 
site. pNeoMO contained minimal origin sequence (nucleotides 7914 to 27) cloned into the BamHI 
site. Linker deletion mutants of the BPV-1 genome (Szymanski and Stenlund, 1991) were used 
as templates for PGR. Primers - 5'-AAAAGCTTTCTTTGGACTTAGA-3' (BPV-1 nucleotides 
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6959-6979) and 5'-ATAGCCAGCTAACTATAGATCT-3' (BPV-1 nucleotides 45 to 63 flanked 
by Bglll site) were used to amplify origin fragments. PCR products were cloned into the Hindlll 
and BamHI site of the pNeoBgl40. Deletion mutants lacked following sequence: D221/234 - 
7187/7892; D36/234 7187/7834; D121/234 - 7187/7771; D134/234 - 7187/7673; DCla/234 - 
5 7187/7475; DSca/234 7187/7389; D221/Cla - 7476/7892; D221/11 - 7611/7892; D221/134 - 
7673/7892; D221/121 7771/7892; D221/36 - 7834/7892; D36/121 - 7771/7834; D121/134 - 
7673/7771; D134/43 - 76227673; D43/1 1 - 7611/7622; Dll/229 - 7597/7611; D229/Cla - 
7476/7597; DCla/41 - 7355/7476; D41/136 - 7344/7356; D136/Nar - 7273/7344; DNar/64 - 
7214/7273 and junctions contained 8-mer BamHI Tinkers; D22 1/234+ 134/1 1x3 contains an 

10 insertion of the fragment 7590-7673 in three copies and D221/234+134/1 1x6 in six copies. 
D22 1/234+ 10BS9 has an insertion of the E2 protein binding site 9 in ten copies. 
DHindIIl/221 + 10BS9 is a deletion between nucleotides 6959/7892 which carries 10 copies of 
oligomerized E2 binding site 9. All deletion and insertion mutants were verified by sequencing 

Construction of cell lines. CHO-Kl (Chinese Hamster Ovary - ATCC CCL 61) was used 

15 as the parental cell line to express BPV-1 replication proteins 

(\) E2 cell line CH049. The E2 expression vector pHSE2 was linearized with Xhol and 
the plasmid pBJ5GS carrying glutamine synthetase minigene expression unit (Bebbington and 
Hentschel 1987). pBJ5GS (kind gift of Dr. L.Berg) was linearized with Sail endonuclease. The 
plasmids were mixed at a 1 : 1 ratio and ligated into the concatemers at high DN A concentration 

20 (300|j.g/m]) overnight at 16*^C using T4 DNA ligase. 

Ten micrograms of the ligated DNA was mixed with 50jig carrier DNA and was 
electroporated into the 7x10^ CHO-Kl cells using Ham's F12 medium supplemented with 10% 
fetal bovine serum at 220V, using a BioRad electroporation apparatus at the capacitance setting 
960nP. Selection for glutamine synthetase was done at 25|iM concentration of the L-methionine 

25 sulfoximine (Sigma) in glutamine free Glasgow Minimal Medium supplemented with dialyzed fetal 
bovine serum, non-essential amino acids, glutamic acid, aspartic acid, sodium pyruvate, 
nucleosides, penicillin and streptomycin, essentially as has been described by Bebbington and 
Hentschel (Bebbington and Hentschel, 1987). Colonies were picked ten days after the selection, 
expanded and used for the second round of selection at 250 (xM of L-methionine sulfoximine. 

30 This step was included to amplify the sequences coupled to the selection marker. Cell lines were 
expanded and tested for expression of E2 protein by immunoprecipitation with polyclonal rabbit 
antibodies against E2 protein (Ustav and Stenlund, 1991) after labeling with ^^S L-methionine 
using Translabel ( ICN) and by functional transient replication assay as described below. 
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Cn) El cell line CH0212. El protein expression vector pCGEag (Ustav and Steniund 
1991) was linearized by the Xhol and pBJ5 GS was linearized, mixed at a 1:1 ratio and ligated 
into the concatemers and the cell line expressing El protein was generated essentially same way 
like E2 expressing cell line. 
5 fiii) El and E2 expressing cell line CH04.15. E! protein expression vector pE 1-1x5 

containing glutamine synthetase minigene and El coding sequence was linearized with Sail and 
pHSE2 was linearized with the Xhol restriction endonuclease. Linear plasmids were ligated into 
the concatemers at a ratio 1:1, CHO-Kl cell were transfected by electroporation and selected as 
described above. 

10 Transient replication assays were done as described earlier (Ustav and Steniund, 1991) 

using the respective cell lines. For testing the cell lines in functional assay for expression of the 
BPV-1 replication proteins we used 50 ng of pUC/Alu (Ustav e( ai, 1991) as origin containing 
plasmid and 250 ng of the El expression vector pEl-lx5 or pCGE2 to complement E2 cell line 
CH049 or El cell line CH0212, respectively. All pNeoS* based origin plasmids were tested for 

15 their ability to replicate in the CH04.15 cell line by transfecting 100 ng of the plasmid DNA 
together with 50ptg of denatured carrier salmon sperm DNA into the CH04.15 cells at 240V by 
electroporation. Extrachromosomal DNA was extracted from the cells at 48 and 72 hours post- 
transfection by alkaline lysis as described earlier (Ustav and Steniund, 199 1). DNA was purified, 
digested with Dpnl and linearizing enzyme and were analyzed by Southern analysis. Specific 

20 probes for hybridization were made by random priming. 

Stable replication. CH04.15 cells were electroporated with lOOng of origin containing 
plasmid DNA in the presence of SO^ig of carrier DNA. Ninety six hours after transfection 
CH04.15 cells were trypsinized and subjected for the antibiotic G418 selection at the 
concentration 450 |i/ml. Colonies were pooled or single colonies were picked after two weeks, 

25 expanded and episomal or total DNA were analyzed by Southern blotting. 

Copy-number measurement. Total DNA from established cell lines, containing replicating 
BPV-1 origin plasmids, was extracted and digested either with a plasmid single-cutter (Hindlll) 
or a plasmid non-cutter (Apal), followed by electrophoresis in 0.7% agarose/TAE gels. The 
copynumber was measured by Southern blotting, using probe containing sequences from the BPV 

30 on and neo gene. Results were quantitated by comparing band intensity to a two-fold dilution 
series of plasmid DNA using phosphoimager. 

BrdU labeling and analysis of the replication mode. Cells were labeled with 35 p-g/nJ 
BrdU in MEM medium, containing 2*-deoxycytidine (20 |ig/ml) using procedures described earlier 
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(Yates and Guan, 1991). Episomal and chromosomal DNA was extracted at time points indicated 
in Results section. CsCl solution was added to DNA preparation to 1.74 g/ml and centrifuged for 
4& hours at 37000 rpm. 24 fractions were collected from each gradient, subjected to denaturation 
and neutralization, and slot-blotted onto nylon fillers. Filters were hybridized with labeled BPV-1 
5 or CHO genomic DNA probes. Radioactivity was counted with Phosphoimager, Fuji. 

Nuclear retention measurement bv stable transformation assay. 48 hours following 
transfection the cells were trypsinized, counted and plated at 3 different dilutions. G418 selection 
was applied, and the selective medium was changed every 3 days. Colonies were counted after 1 0 
days. 

10 

TREATMENT OF DISEASES ACCORDING TO THE INVENTION 

The invention is useful in vivo and ex vivo human gene therapy where correction of 
inherited or acquired genetic defects is desired, and is therefore useful to treat any disease where 

15 gene delivery provides benefit, whether the gene is delivered to a terminally differentiated host 
cell, such as a hepatocyte, or to an undifferentiated cell such as a stem cell. The invention is 
useful in treatment of chronic or acute diseases, e.g., T-cell diseases, inflammation, fibroses of the 
liver, and arthritis. The invention also is useful in vaccination protocols where resistance or 
immunity to infectious pathogens, such as HIV, Hepatitis C Virus, Hepatitis B virus, is desired, or 

20 the elimination or induced quiescence of aberrant cells, such as cancer cells, is considered 
beneficial. 

Recombinant vectors of the invention are useful in that they permit persistent expression 
of a therapeutic gene in both dividing and non-dividing cells; for example, in differentiated cells, 
such as those in gut, brain, and muscle. 

25 Recombinant vectors of the invention are also useful for high level transient expression in 

cells where desired, such as for cancer therapy or in vivo vaccination. 

Both in vivo and ex vivo gene therapy strategies are possible with this vector system, 
including stable, multicopy gene maintenance and expression, in haemopoietic and other stem 
cells, and in the committed and differentiated progeny of these cell types. 

30 Nucleic acid vectors suitable for use in the present invention include circular and linear 

lengths of DNA. 

For human gene therapy, uses of the recombinant vectors of the invention are not limited 
in terms of delivery of the vector to a cell. That is, vectors of the invention may be delivered to a 
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cell via non-viral or viral delivery systems. Delivery systems of non-viral origin include those 
which employ molecular conjugates, cationic liposomes, or synthetic peptides, where vector size 
constraints do not limit the nature and number of plasmid vector components. Delivery systems 
of viral origin include viral particle-producing packaging cell lines as transfection recipients for the 
5 above El/E2/M0/MME-containing plasmids into which viral packaging signals have been 
engineered, such as those of adenovirus, herpes viruses and papovaviruses. 

Recombinant vectors of the invention also are useful in transgenesis, including production 
of transgenic animals via pronuclear injection, or embryonic stem cell transfection and embryo 
chimera generation. 

10 

Heterologous Genes Useful According to the Invention 

Heterologous genes useful in vectors of the invention may also encode antigenic 
determinants of viruses, so as to be useful as vaccines, such antigenic determinants as are present 
15 in coat proteins of flu viruses, malaria, TB, and HIV. For modulation of physiologic activity in 
brain cells, genes including the following may be of interest: CCKA, CCKB, CREB, TH, NT3, 
NT4, BDNF, GDNF, and NGF. 

Regulatory Sequences Useful According to the Invention 

20 

As described below, regulatory sequences which may be carried in vectors of the invention 
will of course include heterologous promoter and enhancer sequences which control expression of 
the heterologous gene of interest, and which may be confer a tight level of regulation upon the 
heterologous gene, including inhibition or activation of gene expression, promoter/enhancer 

25 strength, tissue specificity, or relatively little regulation beyond the initiation of transcription. 

Vectors useful according to the invention may advantageously include a gene encoding a 
therapeutic agent, a promoter which directs expression of the gene, and optionally a cellular- 
derived gene regulatory element which confers tissue specific gene expression. The 
promoter/gene combination may be subject to any one of numerous forms of gene regulation 

30 known in the art, for example, production of the gene product may be subject to continuous 
inhibition by associated factors and thus may require the presence of an activator; alternatively, 
the gene product may be continuously expressed, and only inhibited under certain conditions. 
Gene expression may be regulated at either the transcriptional or translational level. Where such 
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regulation is transcriptional, it may be at the level of the promoter or at the level of RNA 
elongation or processing. Therefore, vectors useful according to the invention may include a 
heterologous promoter/gene combination that is tumed-on and tumed-ofF in trans by the presence 
or absence of a regulatory factor. 

5 

Heterologous Regulatory Sequences for El and E2 Gene Expression 

In addition to heterologous sequences to regulate a heterologous gene of interest which is 
carried on the episomal vector, heterologous regulatory sequences are also used in the invention 

10 to control expression of the papilloma El and E2 genes. Proper control of expression of El and 
E2 is critical for determining plasmid copy number, stability and segregation, and therefore the 
invention encompasses maintenance of an MO+MME-containing plasmid in eukaryotic cells. The 
presence of heterologous regulatory elements has been found to influence persistent expression, 
expression in different cell types, and expression in vivo . 

15 Choice of a heterologous promoter to drive El gene expression has been found to confer 

certain advantages to a vector of the invention, depending upon the intended use of the construct. 
For example, it has been found that a strong promoter such as SRalpha (a hybrid of SV40 early 
region and HTLV-1 LTR) to drive expression of the El gene can be used advantageously 
according to the invention to drive plasmid copy number very high, that is, to levels which render 

20 the host cell unhealthy and prevent normal cell division. This generates unstable, onion skin-like 
products and leads to cell inviability ( Figure 10). However, a strong promoter is advantageously 
used where high level transient expression of a vector heterologous gene is desired, for example, 
in treatment of a cancerous condition where it is desirable to produce a high level of heterologous 
gene product (for example, a toxin) and it is also desirable to kill the host cell (for example, tumor 

25 cells). This type of vector of the invention takes advantage of the ability of the MO/MME vector 
system to replicate at very high levels independently of the cell cycle. 

In contrast, it has also been found that use of a weaker promoter, such as the thymidine 
kinase promoter, to drive El gene expression can be used advantageously in the invention to 
permit sufficient El protein expression to regulate stable (MO+MME)-containing plasmid 

30 replication at high efficiency.(Figure 10). Similarly, use of the weaker promoter such as 
thymidine kinase promoter permits El protein synthesis to a level compatible with efficient 
expression of a reporter gene product from the constructs in transient assay (Figure 11). The 
results shown in Figure 12 demonstrate stable expression of beta-gal in the cells upon selection. 
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SRalpha-El constructs express reporter, however cells are not dividing while Tk-El constructs 
work for expression in transient as well as in stable cells; cells divide normally to give colonies 
which stain blue for beta-gal protein. 

Proper control of post-transcriptional processing of El and E2 transcripts further 
5 influences the replicative and expression properties of the plasmid system (efficiency, 
compatibility with host replication system, stability, toxicity), and is improved by incorporation of 
sequences such as those of human beta globin gene (exxonH, intronll, exonlll sequences), and 
SV40 sequences which incorporate splice, polyA and mRNA stability sequences. Improved 
expression vectors harboring such elements in the El and E2 expression domains are described 
10 herein. Figs. 13 A and 13B show G418 selection of different vectors (SRalpha-El and TkEl 
constructs). 

In addition to the use of regulatory sequences for regulation of El and E2 gene 
expression, the invention also encompasses the use of regulatory sequences for control of a 
heterologous gene encoding a protein of interest, which gene is carried on an episomal vector of 
15 the invention. 

The adenovirus El A promoter and enhancer, the human CMV-MIE promoter and 
enhancer, retroviral LTR elements, herpesvirus promoters, and poxvirus promoters are 
representative examples of heterologous regulatory elements useful in the invention. 

Modification of a heterologous promoter useful according to the invention may be 

20 accomplished according to a number of strategies. For example, the use of negative regulatory 
elements which decrease the level of transcription is envisaged. Especially preferred, however, is 
the modification of the promoter sequences in order to reduce the basal levels of transcription. 
Promoter sequences may be modified in order to remove regulatory elements which respond to 
cell-specific regulatory factors. Preferably, therefore, elements responsible for activation by cell- 

25 specific factors may be mutated or deleted. 

Modified Forms of a Promoter Useful in the Invention 

The invention encompasses the use of modified forms of a heterologous promoter, which 
30 exhibit decreased levels of basal transcription. Such heterologous promoters may be useful in the 
invention, because it may be desirable to confer upon the host cell a low enough level of 
transcription of, for example, an anti-viral gene or a gene encoding a toxin so as to prevent 
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deleterious effects on the host cell by virtue of the presence of the encoded gene product in the 
celL 

Modified forms of a given promoter may be made, as is well-known in the art, using 
conventional PCR and incorporating random or directed base substitutions, deletions, or 
5 insertions in the native promoter sequence. 

Transactivatable viral promoters useful according to the invention include but are not 
limited to the following: Herpes Virus (HSV-1), immediate early promoter (Liv et ai, 1990, 
Biotechniques 9:168; Rice et ai, 1990, Jour. Virol. 64:1704; Howard et ai, 1993, Exp. Cell. 
Res. 207:194); Visna Virus promoters (Carroth et ai, 1994, Jour. Virol. 68:6137); 

10 Papillomavirus promoters (Storey et aL, 1990, Jour. Ge. Virol. 71-965 and Ibarakj et a/., 1993, 
Virus Genes 7:187; Epstein-Barr Virus promoters (TPI and Barn HIC promoters) Cohen ei al, 
1991, Jour. Virol. 65-5880; Sung et aL, 1991 Jour. Virol. 65:2164; and Meitinger et al., 1994, 
Jour. Virol. 68:7497), CMV promoters (IE2, lEl, and MIE promoters) (Malone et a/., 1990, 
Jour. Virol. 64:1498; Cockett et ai, 199 1, Nucl. Ac. Res. 19:319, and Pizzomo et aL,. 1991, 

15 Jour Virol. 65:3839); Hepatitis B promoters (Seto et a/., 1990, Nature 344:72; Raney et a/., 
1991, Jour. Virol. 65:5774; Lauer et aL, 1994, Hepatology 19:23); Spumaretroviral promoters 
(Rethwillm eta/., 1990, Virol. 175:568, Venkatesn etaL, 1993, Jour. Virol. 67:3868); HTLV-l 
promoters (Franklin et al., 1993, Jour. Biol. Chem. 268:21225, Kadison et ai, 1990,. Jour. 
Virol. 64:2141); and Adenoviral promoters (E2)(Oberte/ a/., 1994, Mol. All. Biol. 14:1333). 

20 Heterologous promoters useful according to the invention, including modified forms of a 

given promoter, are first tested for basal activity using a reporter gene, and are also tested to 
determine if they possess or retain the ability to be trans-activated. Where a vector of the 
invention includes heterologous promoter/gene combination which is present in the cell in an 
inhibited state and which requires transactivation for production of the encoded heterologous 

25 gene product, such promoter/gene combinations may be most useful provided they possess or 
retain the ability to be transactivated while exhibiting a low enough level of basal activity such that 
the gene product is virtually undetectable. 

A vector of the invention, whether its heterologous promoter is modified or unmodified, 
may be tested for its ability to be trans-activated to the extent that differential killing of 

30 transformed cells occurs. Functional tests of diniinished basal activity from vectors of the 
invention will include the use of prodrug activating systems, such as ganciclovir, in cells 
containing a vector in which the heterologous promoter/gene combination is in the inhibited 
versus the activated state, 
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Regulatory Elements conferring Cell-Type Specific Heterologous Gene Expression 

Cellular-derived genetic elements also may be included in episomal vectors of the 
5 invention. 

These genetic elements thus are responsive to, i.e., subject to control by, cellular factors. Tissue- 
specific promoters may confer tissue specificity. 

E2 Mutants Defective for Transcriptional Activation 

10 

The invention also encompasses the use of E2 mutants which are defective in the E2 
transcriptional activator fianction, A gene encoding such an E2 mutant is incorporated into the 
episomal vector system of the invention. 

Full-length E2 protein has two fijnctional domains which are well-conserved among the 
15 E2 proteins of different papoUomaviruses, a 200 amino acid transactivation domain at the amino 
terminus and a 90-100 amino acid carboxy terminal domain that is essential for dimerization and 
DNA binding. A flexible spacer hinge region separates the transactivation and DNA binding 
domains. 

In order to provide an E2 mutant which is defective in transcriptional activation, but which 
20 are able to supply the E2 fijnction that is necessary and sufficient for episomal persistence, point 
mutations have been made in the gene encoding this protein. These mutations are useful where 
they eliminate the transcriptional activating activity of the protein without affecting the ability of 
the resultant protein to stably maintain (MCH-MME)-containing plasmids in transfected cells. 

E2 mutants were made by selected mutation of amino acids which are believed to lie in 
25 hydrophilic domains at the protein surface, and by then changing basic or acidic amino acids to a 
small hydrophobic and/or polar residue, e.g., alanine, which has a short side chain consisting of a 
single -CH3 group, serine, which also contains a single carbon atom side chain (-CH2OH); vahne, 
leucine, threonine and isoleucine, which are hydrophobic simino acids having side chains ranging 
from 2-4 carbon-containing groups. The mutation is introduced into the E2 gene using 
30 oligonucleotide-directed mutagenesis in Ml 3 phage. It is within the scope of the invention to 
locate additional point mutations which abolish the transcription activating activity of E2 without 
deleteriously aflfecting its replication function to maintain episomal vectors according to the 
invention. 
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Manufacture of Vectors of the Invention 

The invention also features vectors which include sequences conferring replication and 
5 selection in lower eukaryotic or prokaryotic host cells in order to manufacture a useful quantity of 
vector DNA, e.g., lOOug - mg quantities. 

Additional sequences which may be present in a vector of the invention to enable 
manufacture of vector DNA include other origins of replication, e.g., a bacterial or yeast origin of 
replication, which permits preparation of vector DNA in prokaryotic or eukaryotic cells, or 
10 baculovirus sequences which allow for vector replication in insect cells. 

This aspect of the invention is applicable to most strains of bacteria, for example, gram 
positive and negative bacterial strains, and yeast. Gram negative bacteria useful according to the 
invention include but are not limited to E. coli and Salmonella , e.g., S. typhimurium. Gram 
positive species useful according to the invention include but are not limited to Bacillus and 
1 5 Lactococcus . 

Prokaryotic and Eukarvotic Origins of Replication 

The invention can be utilized advantageously with a plasmid origin of replication that 
20 permits replication of at least 10, preferably at least 20-100, and most preferably at least 200-500 
copies of the plasmid per host cell. Those origins of replication that permit replication of 
moderate (i.e., 2050) to high plasmid (i.e., 200-500) copy numbers are especially useful. Of 
course, if desired, a plasmid having a copy number, as high as 1000-2000 copies per cell, also may 
be used. 

25 Plasmids with low copy numbers (i.e., 10 copies or less) are most advantageously used 

according to the invention after mutation to bring about increased copy number (J. Scott, 1984, 
Microbial Reviews 48:1-23). Of the frequently used origins of replication, pBR322 (20 
copies/cell) is useful according to the invention, as is pUC (at 200 copies/cell). Other plasmids 
whose origins of replication may be useful according to the invention are those which require the 

30 presence of plasmid encoded proteins for replication, for example, the pT181, FII, and FI origins 
of replication. 

Examples of origins of replication which are useful according to the invention in E. coli 
and S tvphimurium include but are not limited to pMBl (25 or more copies per cell. Bolivar et 
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ai, Gene 2:95-113), ColEl (15 or more copies per cell, Kahn et ai, 1979, Methods 

Enzymol. 68:268280), pl5A (about 15 copies per cell, Chang etal, 1978, J. Bacteriol. 134:1141- 
1156); pSClOl (about 6 copies per cell, Stoker ei ai, 1982, Gene 18:335-341); R6K (less than 
15 copies per cell, Kahn et al, 1979, supra); Rl (temperature dependent origin of replication, 
5 Uhlin et al, 1983, Gene 22:255-265); lambda dv (Jackson et aL, 1972, Proc. Nat. Aca. Sci. 
69:2904-2909). An example of an origin of replication that is useful in Staphylococcus is pT181 
(about 20 copies per cell, J. Scott, 1984, Microbial Reviews 48:1-23. Of the above-described 
origins of replication, pMBl, pl5A and ColEl are preferred because these origins do not require 
plasmid-encoded proteins for replication. 

10 

Genes Encoding Selectable Marker Useful in Vectors of the Invention 

Genes encoding selectable markers useful in vectors of the invention include antibiotic 
resistance genes, for example encoding resistance to antibiotics such as ampicillin, kanamycin or 
1 5 tetracycline, are the most common dominant selectable markers used in molecular biology cloning 
and fermentation procedures for the production of recombinant proteins or plasmid DNA. 

Preparation of vector DNA can also be used in conjunction with antibiotic resistance. 
Representative antibiotic resistance genes useful according to the invention include, for example, 
the kanamycin gene, from pUC4K (Pharmacia Biotech) by restricting the plasmid with Xhol and 
20 filling in the 5* overhang. This plasmid DNA is then restricted with PstI and the fragment 
containing the kanamycin gene is then gel purified. Other useful antibiotic resistance genes are 
well-known in the art, including the genes encoding chlorainphenicol acetyl transferase, ampicillin, 
and tetracycline- 

A vector of the invention which is intended for manufacture in bacteria will include in 
25 addition to the papillomavirus replication elements described herein, for example, a bacterial 
origin of replication and a gene encoding a selectable marker. The vector DNA will be 
transformed into a bacterial host, and the host grown under selection conditions. Plasmid DNA is 
then prepared according to conventional means. 

30 Delivery of Episomal Vector to Host Cell 

Vectors of the invention may be delivered to a host cell via any one of a number of vector 
delivery means. The invention is not limited by the mode of delivery of the vector to the host cell. 
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Transfer of a vector of the invention to a host cell can be accomplished via any of the following, 
including but not limited to direct injection of naked DNA, for example, using a gene gun, 
transfection using calcium phosphate coprecipitation, fusion of the target cell with a liposomal 
vehicle, erythrocyte ghosts or spheroplasts carrying DNA, plasmid and viral vector-mediated 
5 transfer, DNA protein complex-mediated gene transfer such as receptor-mediated gene transfer, 
and viral infection. 

Receptor-mediated gene transfer is dependent upon the presence of suitable ligands on the 
surfaces of cells which will allow specific targeting to the desired cell type followed by 
internalization of the complex and expression of the DNA. One form of receptor-mediated gene 

1 0 transfer is wherein a DNA vector is conjugated to antibodies which target with a high degree of 
specificity cell-surface antigens (Wong and Huang, 1987, Proc- Nat. Aca. Sci. 84:7851; Roux 
aL, 1989, Proc. Nat. Aca. Sci. 86::9079; Trubetskoy et aL, 1992, Bioconjugate Chem. 3:323; 
and Hirsch ei ai, 1993, Transplant Proceedings 25:138). Nucleic acid may be attached to 
antibody molecules using polylysine (Wagner et ai, 1990, Proc. Nat. Aca. Sci. 87:3410; Wagner 

15 et ai, 1991, Proc. Nat. Aca. Sci. 89:7934) or via liposomes. 

Increased expression of DNA derived from ligand-DNA complexes taken up by cells via 
an endosomal route has been achieved through the inclusion of endosomal disruption agents, such 
as influenza virus hemagglutinin fusogenic peptides, either in the targeting complex or in the 
medium surrounding the target cell. The fusogenic peptide of the HA molecule is a modified form 

20 of HA which retains two important functions of HA. It allows for fusion of the targeted 
DNA/ligand complex to the cell membrane, but without the host cell sialic acid-binding specificity 
of the natural molecule. Instead, host cell binding specificity is conferred by the ligand/receptor 
interaction. The modified HA fusogenic peptide also retains the HA function of endosomal 
uptake, thus allowing for uptake of the complex into the host cell via membrane fusion, and the 

25 endosomal escape function of HA, which allows for escape of the enveloped DNA from the 
endosomai/Iysosomal destruction pathway. The fusogenic peptide may include the HA amino 
acid sequence GLFGAIAGFIGAGTGGMIAGGGC. 



30 



1 . Viral Vectors 

Recombinant viral vectors as well as other DNA transfer schemes can be used in practice 
of the present invention. A recombinant viral vector of the invention will include DNA of at least 
a portion of a viral genome which portion is capable of infecting the target cells. By "infection" is 
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generally meant the process by which a virus transfers genetic material to its host or target cell. 
Preferably, the virus used in the construction of a vector of the invention is also rendered 
replication-defective to remove the effects of viral replication on the target cells. In such cases, 
the replication-defective viral genome can be packaged by a helper virus in accordance With 
5 conventional techniques. Generally, any virus meeting the above criteria of infectiousness and 
capabilities of functional gene transfer can be employed in the practice of the invention. 

Suitable viruses for practice of the invention include but are not limited to, for example, 
papovavirus and herpesvirus, well known to those skilled in the art, suitable vector packaging cell 
lines for stably transducing mammalian cell lines are known in the art. 
10 It will be appreciated that when viral vector schemes are employed for gene transfer 

according to the invention, the use of an attenuated or a virulent virus also may be desirable. 

2. Delivery of Gene via DNA-Protein Complexes 

13 Targeted gene delivery is also achieved according to the invention using a DNA-protein 

complex. Such DNA-protein complexes include DNA complexes with a ligand that interacts with 
a target cell surface receptor. Cell surface receptors are thus utilized as naturally existing entry 
mechanisms for the specific delivery of genes to selected mammalian cells. It is known that most, 
if not all, mammalian cells possess cell surface binding sites or receptors that recognize, bind and 

20 internalize specific biological molecules, i.e., ligands. These molecules, once recognized and 
bound by the receptors, can be internalized within the target cells within membrane-limited 
vesicles via receptor-mediated endocytosis. Examples of such ligands include but are not limited 
to proteins having functional groups that are exposed sufficiently to be recognized by the cell 
receptors. The particular proteins used will vary with the target cell. 

25 Typically, glycoproteins having exposed terminal carbohydrate groups are used although 

other ligands such as antibodies or polypeptide hormones, also may be employed. Using this 
technique the phototoxic protein psoralen has been conjugated to insulin and internalized by the 
insulin receptor endocylotic pathway (Gasparro, Bio-chem. Biophys. Res. Comm. 141(2), pp 
502509, Dec. 15, 1986); the hepatocyte specific receptor for galactose terminal 

30 asialoglycoproteins has been utilized for the hepatocyte- specific transmembrane delivery of 
asialoorosomucoid-poly-L-lysine non-covalently complexed to a DNA plasmid (Wu, G.Y., J. 
Biol. Chem., 262(10), pp. 44294432, 1987); the cell receptor for epidermal growth factor has 
been utilized to deliver polynucleotides covalently linked to EGF to the cell interior (Myers, 
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European Patent Application 86810614.7, published Jun. 6, 1988); the intestinally situated 
cellular receptor for the organometallic vitamin Bi2-intrinsic factor complex has been used to 
mediate delivery to the circulatory system of a vertebrate host a drug, hormone, bioactive peptide 
or immunogen complexed with vitamin Bn and delivered to the intestine through oral 
5 administration (Russel- Jones et ai, European patent Application 863 07849.9, published Apr. 29, 
1987); the marmose-6-phosphate receptor has been used to deliver low density lipoproteins to 
cells (Murray. G. J. and Neville, D. M., Jr., J. Bio. Chem. Vol. 225 (24), pp. 1 194-1 1948, 1980); 
the cholera toxin binding subunit receptor has been used to deliver insulin to cells lacking insulin 
receptors (Roth and Maddox, J. Cell. Phys. Vol. 115, p. 151, .1983); and the human chorionic 

1 0 gonadotropin receptor has been employed to deliver a ricin a-chain coupled to HCG to cells wdth 
the appropriate HCG receptor in order to kill the cells (Oeltmann and Heath, J. Biol. Chem, Vol 
254, p. 1028 (1979)). Ligands selected from biotin, biotin analogs and biotin receptor-binding 
ligands, and/or folic acid, folate analogs and folate receptor-binding ligands to initiate receptor 
mediated transmembrane transport of the ligand complex, as described in U.S. Pat. No. 

15 5,108,921. 

Generally, a ligand is chemically conjugated by covalent, ionic or hydrogen bonding to the 
nucleic acid. A ligand for a cell surface receptor may be conjugated to a polycation such as 
polylysine with ethylidene diamino carbodiimide as described in U.S. Patent No. 5,166,320. DNA 
may be attached to an appropriate ligand in such a way that the combination thereof or complex 
20 remains soluble, is recognized by the receptor and is internalized by the cell. The DNA is carried 
along with the ligand into the cell, and is then expressed in the cell. The protein conjugate is 
complexed to DNA of a transfection vector by mixing equal mass quantities of protein conjugate 
and DNA in 0.25 molar sodium chloride. The DNA/protein complex is taken up by cells and the 
gene is expressed. 

25 Delivery of the foreign DNA into the target cell may also be achieved via the DNA 

construct* s association with an endosomal disruption agent, such as the influenza hemagglutinin 
fusogenic peptide, as described above. 

3 . Liposomal Gene Transfer 

30 

Liposomes have been used for non-viral delivery of many substances, including nucleic 
acids, viral particles, Jind drugs. A number of reviews have described studies of liposome 
production methodology and properties, their use as carriers for therapeutic agents and their 
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interaction with a variety of cell types. See, for example, "Liposomes as Dmg Carriers," Wiley 
and Sons, NY (1988), and "Liposomes from Biophysics to Therapeutics." Marcel Dekker, NY 
(1987). Several methods have been used for liposomal delivery of DNA into cells, including poly- 
L-lysine conjugated lipids (Zhou ei ai, Biochim. Biophys. Acta. 1065:8-14, 1991), pH sensitive 
5 immunoiiposomes (Gregoriadis, G., Liposome Technology, Vol I, 11, HI, CRC, 1993), and 
cationic liposomes (Feigner etai, Proc. Natl. Acad. Sci., USA, 84:7413-7417, 1987). Positively 
charged liposomes have been used for transfer of heterologous genes into eukaryotic cells 
(Feigner a/., 1987, Proc. Nat. Aca. Sci, 84:7413; Rose a/., 1991, BioTechniques 10:520). 
Cationic liposomes spontaneously complex with plasmid DNA or RNA in solution and facilitate 

10 fusion of the complex with cells in cuhure, resulting in delivery of nucleic acid to the cell. Philip 
et al. 1994, Mol. and Cell. Biol. 14:241 1, report the use of cationic liposomes to facilitate adeno- 
associated virus (AAV) plasmid transfection of primary T lymphocytes and cultured tumor cells. 

Delivery of an agent using liposomes allows for non-invasive treatment of diseases. 
Targeting of an organ or tissue type may be made more efficient using immunoiiposomes, i.e., 

15 liposomes which are conjugated to an antibody specific for an organ-specific or tissue-specific 
antigen. Thus, one approach to targeted DNA delivery is the use of loaded liposomes that have 
been made target-specific by incorporation of specific antibodies on the liposome surface. 

Host Cells Useful in the Invention 

20 

The cells targeted for in vivo or ex vivo gene transfer in accordance with the invention 
include any cells to which the delivery of the therapeutic gene is desired. Eukaryotic cells are 
preferred, and particularly mammalian cells. For example, brain cells, cells of the central nervous 
system, nerve cells, gut cells, skin cells, kidney cells, endothelium, lung cells, liver cells, cells of 
25 the immune system such as T-cells, B-cells, and macrophages. 

Dosage, Mode of Administration and Pharmaceutical Formulation 

Vector DNA may be formulated as naked DNA for parenteral administration or as a 
30 transfection mixture. In the latter case, the transfection mixture may be assembled just prior to 
use. In the case of a pharmaceutical composition, the vector DNA includes the papillomavirus 
MO and MME sequences, optionally, the El and E2 genes, and a gene whose expression is 
intended to have some beneficial therapeutic effect on the cells of the recipient. For optimal 
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efficiency of delivery of naked DNA to a target tissue, it is preferred that the vector DNA be 
delivered at 10-lOOug/l 0,000 cells, optimally about 50ug/10,000 cells. 

The DNA may be exchanged into isotonic phosphate free buffer and sterile filtered, and 
then aliquotted into suitable vials. The vials may be stored at 4°C, 20^C or SO^C or alternatively 
5 the mixture may be freeze dried from a buffer containing an appropriate carrier and bulking agent. 
In these cases, the dosage form is reconstituted with a sterile solution before administration. 

For delivery of vector DNA in vivo or ex vivo , the vector containing a gene of 
physiological importance, such as replacement of a defective gene or an additional potentially 
beneficial gene function, is expected to confer long term genetic modification of the cells and be 
10 effective in the treatment of disease. 

For example, a patient that is subject to a viral or genetic disease, or who is being 
vaccinated against a virus or pathogen, may be treated in accordance with the invention via in 
vivo or ex vivo methods. For example in vivo treatments, a vector of the invention can be 
administered to the patient, preferably in a biologically compatible solution or a pharmaceutically 
1 5 acceptable delivery vehicle, by ingestion, injection, inhalation or any number of other methods. 
The dosages administered will vary from gene to gene, disease to disease, and from patient to 
patient; a "therapeutically effective dose" will be determined by the level of enhancement of 
function of the transferred genetic material balanced against any risk or deleterious side effects. 
Monitoring levels of gene introduction, gene expression and/or the presence or levels of the 
20 encoded therapeutic protein will assist in selecting and adjusting the dosages administered. 
Generally, a composition including the vector will be administered in a single dose in the range of 
10 ng - 1 mg/kg body weight, preferably in the range of 100 ng - 100 ug/kg body weight, such 
that at least one copy of the therapeutic gene is delivered to each target cell Vector DNA may be 
administered as a single dose, or in multiple doses, as needed. The therapeutic gene will, of 
25 course, be associated with appropriate regulatory sequences for expression of the gene in the 
target cell. 

Ex vivo treatment is also contemplated within the present invention. Cell populations can 
be removed from the patient or otherwise provided, transduced with a therapeutic gene in 
accordance with the invention, then reintroduced into the patient. In general, ex vivo cell dosages 
30 will be determined according to the desired therapeutic effect balanced against any deleterious 
side-effects. Such dosages will usually be in the range of 10^- 10* cells per patient, daily weekly, 
or intermittently; preferably 10^- 10^ cells per patient. 
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EXAMPLES 

The following examples describe detailed embodiments of the invention in which a given gene of 
interest is introduced and stably maintained in a specific host cell, or a specific disease is 
5 exemplified for treatment according to the invention. 

EXAMPLE 1 

Recombinant Host Cells of the Invention 

10 

Recombinant vectors of the invention may be used in the in vitro production of a protein 
of interest, for example, cell factories may be prepared by transforming a host cell with a 
recombinant vector of the invention. The vector will contain the papillomavirus MO, MME and a 
gene encoding the protein of interest, and may optionally include the genes encoding the HP V or 

15 BPV El and E2 genes. Alternatively, a cell line may be prepared that carries in its chromosome 
the El and E2 genes, such that the encoded proteins are in trans to the papilloma episomal 
sequences. A cell line carrying a recombinant vector of the invention, whether it carries the El 
and E2 genes in cis or in trans to the vector permits stable, high level expression of proteins of 
therapeutic value in cultured mammalian cells. 

20 Where it is desirable according to the invention to determine the copy number (per haploid 

or diploid host cell genome) of a recombinant vector of the invention. Vector copy number may 
be determined as described herein for copy number determination of the BPV origin-containing 
plasmids. 

Described below is an example in which a host cell line is engineered to contain 
25 chromosomal copies of the El and E2 genes. This cell line is advantageous for production of a 
desired protein or RNA in that it can be transfected with a vector of the invention encoding the 
desired protein or RNA, and cultured to produce that molecule. 

We constructed several cell lines constitutively expressing the El and E2 proteins. 
Expression of these proteins was directed fi-om integrated constructs for El protein from CN4V 
30 promoter (cell line CH0212) and for E2 protein firom HAP70 promoter (cell line CH049). In the 
cell line CH04.15 which expresses both El and E2, the El protein was expressed from 
SRapromoter and the E2 protein from HSP70 promoter. Selection of the respective cell lines and 
amplification of the expression units of interest was achieved by utilizing the glutamine synthetase 



BNSDOCID:<W0 9724451 A2> 



SUBSTITUTE SHEET (RULE 26) 



wo 97/24451 




PCT/EE96/00004 



minigene from the pSVLGS.l plasmid according to the protocol described earlier (Bebbington 
and Hentschel, 1987). Expression of El and E2 was identified by immunoprecipitation using 
specific rabbit polyclonal sera (data not shown) and by in vivo replication assays. The three cell 
lines and the parental CHO cells were transfected with the BPV-1 origin containing plasmid 
5 pUC/Alu in combination with El and E2 expression vectors. The cell line CH04J5 which 
expresses both El and E2, supports replication of the origin plasmids in the absence of erogenous 
El and E2. The E2 expressing cell line, CH049, supports replication in the presence of an El 
expression vector, but fails to do so without erogenous EL The El expressing cell hne, 
CH02I2, supports replication only in the presence of an E2 expression vector. In the parental 

10 CHO cell line, co-expression of both El and E2 is required for replication. No replication of 
pUC/Alu can be detected in the absence of El and E2. 

The cell line CH04.15 is transformed with a vector of the invention, for example, the 
pBN/TKEl-0 vector shown in detail in Fig. 7B. Alternatively, where transient high level 
expression of the gene of interest is desired, the vector pBNEl, shown in Fig. 7 A may be used to 

15 produce relatively higher quantities of the protein of interest. We have observed MO/MME 
vector persistence in CH04.15 cells for at least 80 cell generations with a copy number loss less 
than 10% over 80 generations. Each of these vectors contains the beta-galactosidase reporter 
gene. However, as is described below, the beta-gal gene can be removed and substituted with a 
gene of interest. The transformed cells are grown under conditions, as described in Materials and 

20 Methods, to produce the encoded molecule of interest. 

EXAMPLE 2 

E2 Transcriptional Activator Mutants 

25 

The invention also encompasses the use of E2 point mutants which are defective in the E2 
transcriptional activator function and competent in the replication function. Such E2 mutants are 
incorporated into the episomal vector system of the invention. Point mutations have been made in 
the gene encoding this protein which eliminate the transcriptional activating activity of the protein 
30 without affecting the ability of the resultant protein to stably maintain (MO+MME)-containing 
plasmids in transfected cells. 

The mutants were made by selected mutation of amino acids which are believed to lie in 
hydrophilic domains (i.e., hydrophilic domains of between about 4-15 amino acids) and/or alpha 
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helical domains at the protein surface. Such mutants are made, for example, by amino acid 
substitutions in the alpha helix region 2 (amino acid residues 23-49) and helix 3 (amino acids 
residues 62-82), and by then changing basic or acidic amino acids to a small hydrophobic residue. 
The mutation is introduced into the E2 gene using oligonucleotide-directed mutagenesis in M 13 
5 phage. It is within the scope of the invention to locate additional point mutations which abolish 
the transcription activating activity of E2 without deleteriously affecting its replication function. 

Figure 14A depicts the location of the mutations and schematically illustrates the predicted 
secondary structure (cylinder - -helix, shaded box - -sheet) for the N-terminal and the X-ray 
structure for the C-terminal portions of the E2 protein. The locations of mutations are indicated 
10 by arrows, numbers refer to the position of the mutated residue in the E2 protein. 

Figure 14B demonstrates that mutant E2 proteins are stably expressed in cells. 
Immunoblots of the E2 proteins are shown. Cell lysates were prepared from COS-7 cells 
containing a wild-type (WT) or mutant E2 expression plasmid. 30]j\ of cell lysate was loaded on 
each lane. The negative control lysate was prepared from cells electroporated with carrier DNA. 
15 The substitutions of amino acids are shown on the lettering, and single amino acid code is used; 
the numbers refer to the positions of the substituted residues in the E2 protein. 

Figure 15A shows the replication activation properties of the mutant E2 proteins using 
transient replication assays of the mutated E2 proteins in CHO cells. Cells were electroporated 
with 100 ng of reporter plasmid pUC/Alu, 500 ng of pCGEag and 250 ng of pCGE2, which 
20 expresses wild-type E2 or its derivatives. Cells were harvested either 36 or 48 hours after 
electroporation, low mol.wt. DNA was digested with Dpnl and linearizing enzyme Hind III and 
analyzed by the Southern blotting. 

Figure 15B shows the reporter constructs for determining transcription activation 
properties of the E2 mutants, and origin constructs for determining activation of replication. The 
25 numbers indicate nucleotide positions in the BPV URR sequence. pUCAlu was used for transient 
replication studies. pPCAT and pSV3BS9CAT are the CAT reporter plasmids used in 
transcriptional activation assays. 

Figure 16 shows the transcription activation activity of the mutant E2 proteins and 
represents comparison of transactivation and DNA binding abilities of E2 protein mutants. The 
30 radioactive signals of gel-shif\ assays were quantitated with the use of a Phosphorlmager (data 
not shown). After scanning, the E2 specific signal of wtE2 protein was set as 1 For transcription 
assay CHO cells were electroporated with 250 ng of respective reporter and 250 ng of pCGE2 or 
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its derivatives. Normalized CAT activities were determined 40 hours after transfection. In all 
cases the values shown represent the results of three independent transfection experiments. 

Figure 1 7 demonstrates of the self-squelching feature of some of the E2 mutants at the 
higher concentrations. Transient transcription assay for the E2 protein mutants were performed 
5 as described in Materials and Methods. CHO cells were transfected with increasing amounts of 
the expression vector for E2 mutants or chimeric E2 proteins, as indicated, and with 250 ng of 
reporter plasmid pSV3BS9CAT. Normalized CAT activities were determined 40 hours after 
transfection. Each point represents the result obtained in three independent transfection assays. 
Fig. A. Mutations with nearly wild-type properties in transient transcription assay B. Mutations, 
10 which transcriptional activity has decreased to 50% of that of the wild-type protein C. Inactive 
mutations for transcription. D. Transcriptional properties of chimeric proteins p53:E2 and 
VP16:E2 

The mutants R37A (i.e., Arg-37">Ala), E74A (Glutamic Acid-74 ">Ala), D122A, 
D143A/R172C are particularly useful according to the invention because they are defective or 
1 5 crippled for transcriptional activation of promoters are essentially wild type for activation of 
replication in the transient replication assay. 

Table 1 shows codons which were mutated, giving rise to the described amino acid changes. 
Table 1 

20 



codon 


wt 


mutated 


change 


name 


start 


codon 


codon 






2716 


AGA 


GCA 


Arg-37 Ala 


R37A 


2827 


GAA 


GCA 


Glu-74 Ala 


E74A 


3121 


CGC 


TGC 


Arg-172 Cys 


R172C 



E2 mutants such as those described herein can be tested for their ability to replicate 
MO+MME containing plasmids using the MO/MME replication assays described herein. The use 
of such E2 mutants is advantageous in that it eliminates, or at least minimizes, the possibility of 
25 aberrant activation of cellular genes including proto- oncogenes by the E2 protein, and represents 
an improved safety feature of this system. The safety of the vector is increased by the inactivation 
of the E2 transcription activation domain. Furthermore, the specificity of the vector and also its 
safety is increased because the mutant form of E2 would not activate E2 responsive promoters 
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which may be present on the vector. In addition, the mutant form of E2 described herein would 
not transactivate E2 enhancers. It is known that some E2 binding sites constitute an E2 enhancer. 
These binding sites, when present on a vector according to the invention as part of the iMME, 
would not be transactivated by mutant E2, thus further reducing the deleterious effects of wild- 
5 type E2 in this vector system. Use of the mutant form of E2 disclosed herein also may improve 
the specificity of tissue-specific promoters or other promoters which drive expression of a 
therapeutic (or other) gene on the vector. That is, because promoters may have some leakiness, 
transcription fi"om a leaky promoter driving E2 gene expression may result in E2 expression and 
initiation of E2 transactivation of an E2-driven promoter and gene. However, the inventive 
10 mutant form of E2 is defective in transactivation and therefore would not initiate E2-responsive 
transcription even in the presence of a leaky promoter driving mutant E2 gene expression. 
Therefore, a tissue-specific promoter which drives expression of a gene of interest on a vector of 
the invention may be more tightly regulated as a resuh of use of an E2 mutant according to the 
invention. 

15 

EXAMPLE 3 

Stable persistence of Vector of the Invention in Muscle Cells 

20 The usefulness of the EI/E2/M0/MME episomal vector system for in vivo deliver^' to, and 

long term expression in, manimalian muscle is demonstrated as follows. Generation of a humoral 
immune response to genes expressed in these vectors has been achieved by expression in muscle 
by direct intramuscular injection of vector DNA, testifying to its utility as an in vivo vaccine 
system. Briefly, the E1/E2/M0/MME plasmid is directly injected into mice intramuscularly into 

25 each leg; 50ul saline containing lOOug plasmid DNA is injected. into the quadraceps muscle of 
each leg. Injections were performed in 1, 2, or 3 sets at 3 week intervals ( Ulmer, 1993, Science 
259;1745). Fig. 19 shows the results of the experiment. 0.1 mg of plasmid DNA was injected into 
mice intramuscularly. Serum samples were collected at 90 days after injection. Antibodies were 
measured by ELISA using recombinant 3-galactosidase as antigen. The OD at 414 was read 

30 using a 1: 1000 dilution of serum. "Neg. Control" refers to saline alone injection, "pUE83" refers 
to an RSV LTR promoter-driven p-gal expression cartridge cloned into pUC19; '* SRalpha 
vector" refers to the E1/E2/M0/MME plasmid containing the El and E2 genes driven by the 
SRalpha promoter and the same p-gal expression cartridge; and "TK vector" refers to the El 
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/E2/M0/MME plasmid containing the El and E2 genes driven by the TK promoter and the same 
3-gal expression cartridge. The MME in the SRalpha and TK vectors is the region in the BPV 
URR corresponding to positions 6958-1. 

5 EXAMPLE 4 

Stable persistence of Vector of the Invention in Brain Cells 

The follovy^ng example demonstrates persistent, long term expression of lacZ reporter 
10 gene in the central nervous system using an E1/E2/M0/MME vector harboring a lacZ reporter 
gene (Figure 18). Briefly, the vector SR-alpha #2 (SR promoter directing El and E2 
expression in the E1/E2/M0/MME plasmid described in detail above) was injected into an adult 
rat brain striatum region (bregma +1.7 dex. +2.5). Total amount of vector DNA was 50 
micrograms in 5 microliter of PBS. Plasmid DNA was dissolved in PBS (panels E and F) or 
1 5 artificial cerebrospinal fluid (panels A-D). The resuUs showed that P -galactosidase activity was 
detected histochemically by X-GAL regular staining protocol on frozen brain tissue slices after 5 
days (panels A and B), two weeks (panels C and D) and three months (panels E and F) exposure. 
Control animals were injected with PBS (E) or CSF (panels A and C). Therefore, the plasmid 
was stably maintained in brain tissue for as long as three months after introduction. 
20 The above example demonstrates the utility of vectors of the invention in gene therapy and 

in providing animal models of disease and therapy. 
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OTHER EMBODIMENTS 

5 

Other embodiments will be evident to those of skill in the art. It should be understood 

that 

the foregoing detailed description is provided for clarity only and is merely exemplary. The spirit 
and scope of the present invention are not limited to the above examples, but are encompassed by 
10 the following claims. 
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Claims 

1 . A method of obtaining long-term stable production of a gene product of interest in a host 
cell, comprising 
5 providing a host cell containing a vector comprising 

(A) a minimal origin of replication of a papilloma virus, 

(B) a minichromosomal maintenance element of a papilloma virus, and 

(C) a gene encoding said gene product, 

wherein said vector, when present in a mammalian host cell, persists in said cell for at least 
10 about 50 cell generations in dividing cells or for at least about 8 weeks in non-dividing cells under 
nonselective conditions without an appreciable loss of copy number. 



2. A method of obtaining long-term stable production of a gene product of interest in a host 
cell, comprising 

15 providing a host cell containing a vector comprising papillomavirus sequences consisting 

essentially of 

(A) a papillomavirus El gene and E2 gene, 

(B) a minimal origin of replication of a papilloma virus, 

(C) a minichromosomal maintenance element of a papilloma virus, and 
20 (D) a gene encoding said gene product, 

wherein said vector persists in said cell for at least about 50 cell generations in dividing cells 
or for at least about 8 weeks in non-dividing cells under nonselective conditions without an 
appreciable loss of copy number. 

25 3 . A method of obtaining long-term stable production of a gene product of interest in a host 

cell, comprising 

providing a host cell containing a pair of vectors comprising 

(I) a first vector comprising papillomavirus sequences consisting essentially of 

(A) a papillomavirus El and E2 gene, 
30 (B) a minimal origin of replication of a papilloma virus, and 

(C) a minichromosomal maintenance element of a papilloma virus, and 

(II) a second vector comprising papillomavirus sequences consisting essentially of 
(A) a gene encoding said gene product, 
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(B) a minimal origin of replication of a papilloma virus, and 
(C) a minichromosomal maintenance element of a papilloma virus, 

wherein said vector persists in said cell for at least about 50 cell generations in dividing cells 
or for at least about 8 weeks in non-dividing cells under nonselective conditions without an 
5 appreciable loss of copy number. 

4. Use of a recombinant vector for obtaining long term stable maintenance of erogenous 
DNA in a eukaryotic host cell wherein the recombinant vector comprises: 
minimal origin of replication of a papillomavirus; 
10 minichromosomal maintenance element of a papillomavirus, and 

heterologous DNA sequence encoding an expressible gene. 



5. The method of claims 1-3 or the use of claim 4, said minichromosomal maintenance 
element of papillomavirus being from BPV. 

15 

6. The method of claims 1-3 or the use of claim 4, said minimal origin of replication of 
papillomavirus being from BPV. 

7. The method of claim 1 or the use of claim 4, further comprising a papillomavirus gene 
20 encoding £2. 

8. The method of claims 1-3 or the use of claim 4, further comprising a papillomavirus gene 
encoding EL 

25 9. The method or use of claim 7 or 8 wherein said papillomavirus gene encoding El or E2 

comprises a structural gene encoding El or E2 operatively associated with regulatory sequences 
for expression of the structural gene in a host cell. 

10. The method or use of claim 9, said regulatory sequences comprising a promoter. 

30 

1 1. The method or use of claim 10, said promoter comprising a promoter that is non-native 
to said structural gene. 
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12. The method or use of claim 11, said promoter being functional in more than a single 
tissue type. 

13. The method or use of claim 11, said promoter being functionally restricted to a single 
5 tissue type. 

14. The method or use of claim 11, said promoter comprising one of the thymidine kinase 
promoter and the SR alpha promoter, 

10 15. The method or use of claim 1 1, said promoter being a strong promoter. 

16. The method of claims 1-3 or the use of claim 4, said vector further comprising a bacterial 
host cell origin of replication. 

15 17. The method of claims 1-3 or the use of claim 4, said vector further comprising a gene 

encoding a selectable marker. 

18. The method of claims 1-3 or use of claim 4 wherein said gene is operatively associated 
with regulatory sequences for expression of the gene in a host cell. 

20 

19. The method or use of claim 18, said regulatory sequences comprising a promoter. 

20. The method or use of claim 19, said promoter comprising a promoter that is non-native 
to said structural gene. 

25 

21. The method or use of claim 20, said promoter being functional in more than a single cell 
type. 

22. The method or use of claim 20, said promoter being functionally restricted to a single cell 
30 type. 

23. The method or use of claim 7 wherein said gene encodes a mutant form of E2 which is 
replication competent but defective in transcriptional activation wherein said mutant form of E2 
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protein differs from the wild-type E2 in a nucleotide point mutation which translates into an 
amino acid substitution. 

24. The method or use of claim 23 wherein said gene encoding said mutant form of E2 is 
operatively associated with a cell-type-restricted promoter. 

25. The method or use of claim 23 wherein said gene encoding said mutant form of E2 is in 
trans to said minimal maintenance element and operatively associated with a cell-type-restricted 
promoter. 

26. The method or use of claim 23, said amino acid substitution being R37A. 

27. The method or use of claim 23, said amino acid being E74A. 

28. The method or use of claim 23, said amino acid being D 122 A and D143A/R172C. 

29. The method of claims 1-3 or the use of claim 4 wherein the minichromosomal 
maintenance element and the minimal origin of replication consists of a DNA sequence that is 
different from the natural papillomavirus sequence. 

30. The method of claims 1-3 or the use of claim 4 wherein the minichromosomal 
maintenance element and the minimal origin of replication are separated by a distance of less than 
about 1.0 kb. 

31. The method of claims 1-3 or the use of claim 4 wherein the minichromosomal 
maintenance element consists essentially of the region of BPV mapping to positions 7590 to 7673. 

32. The method of claims 1-3 or the use of claim 4 wherein the minichromosomal 
maintenance element comprises (BPV E2 binding sites 6, 7 and 8) x, wherein x is 3 to 6. 

33. The method of claims 1-3 or the use of claim 4 wherein the minichromosomal 
maintenance element comprises at least 2 of the 3 E2 binding sites 6, 7 and 8. 
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34. A vector for use in any one of claims 30-33. 

35. A recombinant vector for stable maintenance of erogenous DNA in a eukaryotic host 
cell, the vector comprising 

5 papilloma virus sequences consisting essentially of 

(A) a minimal origin of replication of a papilloma virus, 

(B) a minichromosomai maintenance element of a papilloma virus consisting 
essentially of at least two of the three E2 binding sites 6, 7, and 8, 

wherein the region of the vector comprising the minimal origin of replication and 
10 minichromosomai maintenance element consists of a DNA sequence different from the natural 
papilloma virus sequence, and 

wherein said vector, when present in a mammalian host cell which expresses El and E2, 
persists in said cell for at least about 50 cell generations in dividing cells or for at least about 8 
weeks in non-dividing cells under nonselective conditions without an appreciable loss of copy 
1 5 number. 

36. A recombinant vector for stable maintenance of erogenous DNA in a eukaryotic host 
cell, the vector comprising 

papilloma virus sequences consisting essentially of 
20 (A) a minimal origin of replication of a papilloma virus, and 

(B) a minichromosomai maintenance element of a papilloma virus consisting 
essentially of multiple E2 binding sites, 

wherein the distance between said minimal origin of replication and said minichromosomai 
maintenance element is less than about 1.0 kb, 
25 wherein said vector, when present in a mammalian host cell which expresses El and E2, 

persists in said cell for at least about 50 cell generations in dividing cells or for at least about 8 
weeks in non-dividing cells under nonselective conditions without an appreciable loss of copy 
number. 

30 37. A recombinant vector for stable maintenance of erogenous DNA in a eukaryotic host 

cell, the vector comprising 

papilloma virus sequences consisting essentially of 

(A) a minimal origin of replication of a papilloma virus, 
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(B) a minichromosomal maintenance element of a papilloma vims consisting 
essentiaJly of the region of BPV mapping to about positions 7590-7673 

wherein said vector, when present in a mammalian host cell which expresses El and E2, 
persists in said cell for at least about 50 cell generations in dividing cells or for at least about 8 
5 weeks in non-dividing cells under nonselective conditions without an appreciable loss of copy 
number. 

38. A recombinant vector for stable maintenance of erogenous DNA in a eukaryotic host 
ceil, the vector comprising 

10 papilloma virus sequences consisting essentially of 

(A) a minimal origin of replication of a papilloma virus, and 

(B) a minichromosomal maintenance element of a papilloma virus consisting 
essentially of (BPV E2 binding sites 6, 7, and 8)x wherein x is 3 - 6, 

wherein said vector, when present in a mammalian host cell which expresses El and E2, 
15 persists in said cell for at least about 50 cell generations in dividing cells or for at least about 8 
weeks in non-dividing cells under nonselective conditions without an appreciable loss of copy 
number. 

39. The recombinant vector of claims 35-38, said minichromosomal maintenance element of 
20 papillomavirus being from BPV. 

40. The recombinant vector of claims 35-38, said minimal origin of replication of 
papillomavirus being from BPV. 

25 41. The recombinant vector of claims 35-38, further comprising an expressible gene of 

interest. 

42. The recombinant vector of claims 35-38, further comprising a papillomavirus gene 
encoding El. 

30 

43. The recombinant vector of claims 35-38, further comprising an papillomavirus gene 
encoding E2. 
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44. The recombinant vector of claim 42 or 43 wherein said papillomavirus gene encoding El 
or E2 comprises a structural gene encoding El or E2 operatively associated with regulatory 
sequences for expression of the structural gene in a host cell. 

5 45. The recombinant vector of claim 44, said regulatory sequences comprising a promoter. 

46. The recombinant vector of claim 45, said promoter comprising a promoter that is non- 
native to said structural gene. 

10 47. The recombinant vector of claim 46, said promoter being functional in more than a single 

cell type. 

48. The recombinant vector of claim 46, said promoter being functionally restricted to a 
single cell type. 

15 

49. The recombinant vector of claim 46, said promoter comprising one of the thymidine 
kinase promoter and the SR alpha promoter. 

50. The recombinant vector of claim 46, said promoter being a strong promoter. 

20 

5 1 . Therecombinantvectorofclaims3 5-3 8,furthercomprisingabacterialhostcellorigin of 
replication. 

52. The recombinant vector of claim 51, further comprising a gene encoding a selectable 
25 marker. 

53. The recombinant vector of claim 41 wherein said gene comprises a structural gene 
operatively associated with regulatory sequences for expression of the structural gene in a host 
cell. 

30 , 

54. The recombinant vector of claim 53, said regulatory sequences comprising a promoter. 
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55. The recombinant vector of claim 54, said promoter comprising a promoter that is non- 
native to said structural gene. 

56. The recombinant vector of claim 55, said promoter being functional in more than a single 
5 cell type. 

57. The recombinant vector of claim 55, said promoter being functionally restricted to a 
single cell type. 

10 58. The recombinant vector of claim 43 wherein said gene encodes a mutant form of E2 

which is replication competent but defective in transcriptional activation, wherein the mutation is 
a point mutation. 

59. The recombinant vector of claim 58 wherein said gene encoding said mutant form of E2 
15 is operatively associated with a cell-type restricted promoter, 

60. The recombinant vector of claim 58 wherein said gene encoding said mutant form of E2 
is in trans to said minimal maintenance element and operatively associated with a cell-type- 
restricted promoter. 

20 

61 . A host cell containing the vector of claims 35-60. 

62. The host cell of claim 61, said cell being mammalian. 

25 63. The host cell of claim 61, said mammalian cell being muscle or brain. 

64, A method of obtaining stable long-term expression of a gene of interest in a cell, 
comprising providing the host cell of claim 61. 

30 65. A method of obtaining stable expression of a gene of interest in a cell, comprising 

introducing into a host cell the recombinant vector of claims 35-38. 
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66. A method of treating a disease stemming from a genetic defect, comprising administering 
a therapeutically effective amount the vector of claims 35 -38 to a patient afflicted with said 
disease. 

5 67. A method of manufacture of vector DNA, comprising culturing the host cell of claim 61. 

68. A method of producing a protein of interest in a host cell, comprising culturing the host 
cell of claim 61 under conditions which permit expression of said gene of interest. 

10 69. A method of producing a protein in a transgenic animal, comprising providing a 

transgenic animal containing the vector of claim 41 which produces the protein of interest. 

70. A mutant form of a papillomavirus E2 protein wherein the replication function of said 
protein is competent and the transcriptional activation function of said protein is defective, 

1 5 wherein said mutant form of E2 protein differs from the wild-type E2 in a nucleotide point 
mutation which translates into an amino acid substitution. 

71. The mutant form of a papillomavirus E2 protein of claim 70, said amino acid substitution 
being R37A. 

20 

72. The mutant form of a papillomavirus E2 protein of claim 70, said amino acid substitution 
being E74A. 

73. The mutant form of a papillomavirus E2 protein of claim 70, said amino acid substitution 
25 being D 122 A and D 143A/R172C. 

74. A gene encoding the mutant form of the papillomavirus E2 protein of claim 70. 
75 A host cell transformed with the gene of claim 74. 

30 

76. A kit for providing stable persistence of a vector in a host cell, the kit comprising the 
vector of claims 35-38 and packaging materials therefor. 
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77. A kit for providing a stably transformed host cell to a patient, the kit comprising the host 
cell of claim 61 and packaging materials therefor. 

78. A kit for providing a mutant E2 protein to a host cell for stable persistence of a vector in 
5 the host cell, the kit comprising the mutant E2 protein of claim 70, and packaging materials 

therefor. 

79. A kit for providing a mutant E2 protein to a host cell for stable persistence of a vector in 
the host cell, the kit comprising the gene of claim 74, and packaging materials therefor. 



80. A mammalian disease model comprising a transgenic animal whose celts contain the 
vector of claim 4 1 . 



10 
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2600 2610 2620 2630 2640 

I I I I I 

GTG AAG AGG ATG GAG ACA GCA TGC GAA CGT TTA CAT GTA GCG CAA 
Val Lys Arg MET Glu Thr Ala Cys Glu Arg Leu His Val Ala Gin 

1 

2650 2660 2670 2680 

I I I I 

GAA ACA CAA ATG CAG TTG ATT GAG AAA AGT AGT GAT AAG TTG CAA 

Glu Thr Gin MET Gin Leu lie Glu Lys Ser Ser Asp Lys Leu Gin 
13 

2690 2700 2710 2720 2730 

I I I ** I I 

GAT CAT ATA CTG TAC TGG ACT GCT GTT AGA ACT GAG AAC ACA CTG 
Asp His lie Leu Tyr Trp Thr Ala Val Arg Thr Glu Asn Thr Leu 
28 

2740 2750 2760 2770 

I I I I 

CTT TAT GCT GCA AGG AAA AAA GGG GTG ACT GTC CTA GGA CAC TGC 

Leu Tyr Ala Ala Arg Lys Lys Gly Val Thr Val Leu Gly His Cys 

43 

2780 2790 2800 2810 2820 

I I I I I 
AGA GTA CCA CAC TCT GTA GTT TGT CAA GAG AGA GCC AAG CAG GCC 
Arg Val Pro His Ser Val Val Cys Gin Glu Arg Ala Lys Gin Ala 
58 

2830 2840 2850 2860 

* I I I I 

ATT GAA ATG CAG TTG TCT TTG CAG GAG TTA AGC AAA ACT GAG TTT 

lie Glu MET Gin Leu Ser Leu Gin Glu Leu Ser Lys Thr Glu Phe 

73 

2870 2880 2890 2900 2910 

II III 
GGG GAT GAA CCA TGG TCT TTG CTT GAC ACA AGC TGG GAC CGA TAT 
Gly Asp Glu Pro Trp Ser Leu Leu Asp Thr Ser Trp Asp Arg Tyr 
88 

2920 2930 2940 2950 

I I I I 

ATG TCA GAA CCT AAA CGG TGC TTT AAG AAA GGC GCC AGG GTG GTA 

MET Ser Glu Pro Lys Arg Cys Phe Lys Lys Gly Ala Arg Val Val 

103 
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2960 2970 2980 2990 3000 

I I I I I 

GAG GTG GAG TTT GAT GGA AAT GCA AGC AAT ACA AAC TGG TAC ACT 
Glu Val Glu Phe Asp Gly Asn Ala Ser Asn Thr Asn Trp Tyr Thr 
118 

3010 3020 3030 3040 

I I I I 

GTC TAC AGC AAT TTG TAC ATG CGC ACA GAG GAC GGC TGG CAG CTT 
Val Tyr Ser Asn Leu Tyr MET Arg Thr Glu Asp Gly Trp Gin Leu 
133 

3050 3060 3070 3080 3090 

I I i I I 

GCG AAG GCT GGG GCT GAC GGA ACT GGG CTC TAC TAC TGC ACC ATG 

Ala Lys Ala Gly Ala Asp Gly Thr Gly Leu Tyr Tyr Cys Thr MET 

148 

3100 3110 3120 3130 

I I I * I 

GCC GGT GCT GGA CGC ATT TAC TAT TCT CGC TTT GGT GAC GAG GCA 
Ala Gly Ala Gly Arg lie Tyr Tyr Ser Arg Phe Gly Asp Glu Ala 
163 

3140 3150 3160 3170 3180 

I I I I i 

GCC AGA TTT AGT ACA ACA GGG CAT TAC TCT GTA AGA GAT CAG GAC 

Ala Arg Phe Ser Thr Thr Gly His Tyr Ser Val Arg Asp Gin Asp 

178 

3190 3200 3210 3220 

III I 
AGA GTG TAT GCT GGT GTC TCA TCC ACC TCT TCT GAT TTT AGA GAT 

Arg Val Tyr Ala Gly Val Ser Ser Thr Ser Ser Asp Phe Arg Asp 

193 



3230 3240 3250 3260 3270 

I I I I I 

CGC CCA GAC GGA GTC TGG GTC GCA TCC GAA GGA CCT GAA GGA GAC 

Arg Pro Asp Gly Val Trp Val Ala Ser Glu Gly Pro Glu Gly Asp 

208 

3280 3290 3300 3310 

I I I I 

CCT GCA GGA AAA GAA GCC GAG CCA GCC CAG CCT GTC TCT TCT TTG 

Pro Ala Gly Lys Glu Ala Glu Pro Ala Gin Pro Val Ser Ser Leu 

223 
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3320 3330 3340 3350 3360 

CTC GGC TCC CCC GCC TGC GGT CCC ATC AGA GCA GGC CTC GGT TGG 
Leu Gly ser Pro Ala Cys Gly Pro He Arg Ala Gly Leu Gly Trp 
238 

3370 3380 3390 3400 

III I 
GTA CGG GAC GGT CCT CGC TCG CAC CCC TAC AAT TTT CCT GCA GGC 

Val Arg Asp Gly Pro Arg Ser His Pro Tyr Asn Phe Pro Ala Gly 
253 

3410 3420 3430 3440 3450 

I I I I I 

TCG GGG GGC TCT ATT CTC CGC TCT TCC TCC ACC CCC GTG CAG GGC 
Ser Gly Gly Ser lie Leu Arg Ser Ser Ser Thr Pro Val Gin Gly 
268 

3460 3470 3480 3490 

I I i i 

ACG GTA CCG GTG GAC TTG GCA TCA AGG CAG GAA GAA GAG GAG CAG 
Thr Val Pro Val Asp Leu Ala Ser Arg Gin Glu Glu Glu Glu Gin 
283 

3500 3510 3520 3530 3540 

I I I I { 

TCG CCC GAC TCC ACA GAG GAA GAA CCA GTG ACT CTC CCA AGG CGC 
Ser Pro Asp Ser Thr Glu Glu Glu Pro Val Thr Leu Pro Arg Arg 
298 



3550 3560 3570 3580 

I I I I 

ACC ACC AAT GAT GGA TTC CAC CTG TTA AAG GCA GGA GGG TCA TGC 
Thr Thr Asn Asp Gly Phe His Leu Leu Lys Ala Gly Gly Ser Cys 
313 

3590 3600 3610 3620 3630 

I I I I I 

TTT GCT CTA ATT TCA GGA ACT GCT AAC CAG GTA AAG TGC TAT CGC 
Phe Ala Leu He Ser Gly Thr Ala Asn Gin Val Lys Cys Tyr Arg 
328 

3640 3650 3660 3670 

III I 
TTT CGG GTG AAA AAG AAC CAT AGA CAT CGC TAC GAG AAC TGC ACC 

Phe Arg Val Lys Lys Asn His Arg His Arg Tyr Glu Asn Cys Thr 
343 
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3680 3690 3700 3710 3720 

ACC ACC TGG TTC ACA GTT GCT GAC AAC GOT GCT GAA AGA CAA GGA 
Thr Thr Trp Phe Thr Val Ala Asp Asn Gly Ala Glu Arg Gin Gly 
358 

3730 3740 3750 3760 

111 I 
CAA GCA CAA ATA CTG ATC ACC TTT GGA TCG CCA AGT CAA AGG CAA 
Gin Ala Gin lie Leu He Thr Phe Gly Ser Pro Ser Gin Arg Gin 
373 

3770 3780 3790 3800 3810 

II i I * 

GAC TTT CTG AAA CAT GTA CCA CTA CCT CCT GGA ATG AAC ATT TCC 
Asp Phe Leu Lys His Val Pro Leu Pro Pro Gly MET Asn He Ser 
388 

3820 3830 3840 3850 

ill I 

GGC TTT ACA GCC AGC TTG GAC TTC TGA TCA CTG CCA TTG CCT TTT 

Gly Phe Thr Ala Ser Leu Asp Phe — Ser Leu Pro Leu Pro Phe 
403 
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